VOLUME 39 


PROCEEDINGS 
OF THE 


NATIONAL ACADEMY 
OF SCIENCES 


OF THE 
UNITED STATES OF AMERICA 


Linus Pauiinc, Chairman Epwin B. Witson, Managing Editor 
ALEXANDER Wetmore, Home Secretary Rocgr Apams, Foreign Secretary 
Wituiam W. Rusey, Chairman of the National Research Council 
J. W. Beams G. W. BEADLE Grecory Breir 
Kk. E, CLELAND BE. A. Dorsy L. C: Dunn | 
W. W. Rusey G. C,. Evans W. F. Lipsy 
HarLow SHAPLEY C. H. GRAHAM J. von NEUMANN 
F, E. TeRMAN J. T. Patterson Pau. WEIss 





Publication Office: Mack Printing Company, Easton, Pa. 
Editorial Office and Home Office of the Academy 
2101 Constitution Avenue, Washington 25, D. C. 


Batered as Second-Ciass matter, June 21, 1920, at the Post-office at aston, Pennsylvania, ander the Act of August 24, 1912, 
Accepted for mailing at a special rate of postage provided for in section 1103, act of October 3, 1917. Authorized June 18, 1920. 


Published Monthly Price per Annum, $7.50 





8 Bi PR MRO 1 Seat reas tm SBeSETS gy EE Te 
Ms ko te Te eh epee Se Ce a 


< 


i oe 


NATIONAL ACaDs 


| ited. 


ee 


ts i 
=~ hod 
Ee fg 2358) 





PROCEEDINGS 


OF THE 


NATIONAL ACADEMY OF SCIENCES 


Volume 39 June 15, 1953 Number 6 


SEROLOGICAL REACTIVITY OF SYNTHETIC POLYGLUCOSES 
By MICHAEL HEIDELBERGER AND ALAN C. AISENBERG 


Dept. OF MEDICINE, COLUMBIA UNIVERSITY COLLEGE OF PHYSICIANS AND SURGEONS, 
AND THE PRESBYTERIAN HOSPITAL, NEW YORK 


Communicated March 9, 19538 


Because of the great complexity of natural substances with antigenic 
properties little is known of the chemical basis of their immunological 
specificity. Methods for the determination of the fine structure of the 
proteins, the class of substances to which many antigens belong, are still in 
their infancy. Knowledge of the particularities of the structure of carbo- 
hydrates is, however, in a more advanced state. Within this class of sub- 
stances certain members, such as the type-specific capsular polysaccharides 
of pneumococci,' not only interact with antibodies to yield precipitates, 
but may even stimulate the production of antibodies in, for example, man 
and the mouse. The constituent sugars of several of the type-specific 
polysaccharides are known, and the fine structure of the specific poly- 
saccharide of Type III pneumococcus has been worked out,’ as has also 
that of one of the specific carbohydrates of the tubercle bacillus.* More- 
over, the development of quantitative, absolute methods for the estimation 
of antibodies‘ provided a new tool for the study of immune reactions and 
led to a quantitative theory of the reaction of specific precipitation,’ in 
terms of which the cross-reactivities of chemically related specific poly- 
saccharides such as those of pneumococcus Types III and VIII could be 
interpreted.® 

The recent production of synthetic polyglucoses with possible uses as 
plasma extenders furnished the stimulus for the studies which follow.* 
Dextran, a series of polyglucoses built up by various strains of Leuconostoc, 
is known to give precipitates in antipneumococcus sera of Types II, XII, 
XX and, to a smaller extent, XXII.7 Severe reactions occasionally en- 
countered in the use of dextran in human beings have been ascribed to its 


possible reactivity with preexisting antibodies.” A serological study of the 


synthetic polyglucoses was therefore of practical as well as theoretical in- 


terest. 





454. BIOCHEMISTRY: HEIDELBERGER AND AISENBERG Proc. N. A. S 


E:xperimental.—A polymer of glucose, 52R611, kindly supplied by Merck 
& Co., Inc., Rahway, N. | i was fractionated as follows: 10 g. were 
added to 100 ml. of 5% sodium acetate solution. The dark mixture was 
brought to pH 6.5 by addition of sodium hydroxide and allowed to stand in 
the cold for two days, with cecasional mixing. All subsequent operations 
were also carried out in the cold. The soluble portion, at a volume of 76 
ml., was mixed with 49 ml. of chilled isopropyl alcohol’ which sufficed to 
form a precipitate. After two days this was centrifuged off and taken up in 
water. The centrifuged solution and the washings of a residual gel were 
poured into several volumes of redistilled ethyl alcohol and the precipitate 
was filtered off and dried in vacuo (fraction A). The original supernatant 
from which A had separated was treated with 20 ml. more of isopropyl 
alcohol and the precipitate (B) isolated as was A. The supernatant from B 
was made up to about 225 ml. with 95 per cent ethyl alcohol and the precipi- 
tated fraction (D) was isolated as before. Fraction A was divided roughly 


TABLE 1 
FRACTIONATION OF 10 G. OF GLUCOSE POLYMER 52R611 
REDUCING SUGAR MAXIMUM ANTIBODY N, ©), PPTD. FROM 


9500 eso AS GLUCOSE, _ ANTISERUM, TYPE 
FRACTION YIELD, G nel Nred UNHYDROLYZED, % x XII XX XXII 


A 0.19 1.182 3.0 4.6 35 50 50 35 
A’ 0.14 1.075 1.25 ‘ 21 35 35 25 
B 1.16 1.061 1.0 4: 19 30 25 20 
D 1.45 1.0380 0.5 . 3 15 6 4 
E 1.8” 1.027 0.45 ‘ 0.1 5 3 0.2 
“0.6% solution in 0.859% NaCl solution. ‘‘rel’’ and ‘“‘red’”’ refer to relative and re- 
duced viscosities, respectively. 
® Also per 10 g. polyglucose. 


in half by partial precipitation from 10 ml. of 5 per cent sodium acetate 
solution with glacial acetic acid, a method known to be useful in the separa- 
tion of serologically active from inactive polysaccharides.’ The insoluble 
fraction was designated A and the soluble portion A’. Properties of the four 
fractions are summarized in table 1, and their quantitative behavior 
toward a Type XII antipneumococcus horse serum is shown in figure 1. 
Plots of data obtained in antisera to Types IX, XX, and XXII were re- 
markably similar, and are therefore not given. 

In an attempt to find a fraction without serological reactivity 20 g. of 
polymer 52R611 were treated as above for the separation of fraction A. 
Material corresponding to fraction B and part of D was then precipitated 
in the cold by bringing the isopropyl alcohol concentration to 51 per cent by 
volume instead of to 47.5 per cent, as in the first instance. All residual 
polymer precipitable by alcohol (E) was then thrown down by bringing the 
volume to 1 liter with 95 per cent ethanol in the cold. The yield was 3.6 g. 





000'9F-000'1Z ‘A3vs9Ae Joquinu “3M [OTT » 
UONLSNJLIZUI Ja}je SZUIPvas 9}VOIPUl SaseyjuasEd Ul saNyeA , 


6¢°0 ‘AVISOOSIA paonpay ‘ 


AISENBERG 


€E'O ‘AUSOISIA psonpay |, 


AND 


NOSIN 
VESN 
GtLY 

UPINXAC] 
p9e-S 1d 
STFOUTS 
VITESSE 
FLIFC USS 
elFoUuss 
Polen cits (55 
pb0TGUGS 
asoon[3 

-Ajod onayqwAs 


® 
Q 
So) 
Q 
nt 
~ 
~ 
Q) 
~ 
~ 
~ 
Q 
~ 
~ 


+ HdAL 
WAARSILNY 


BIOCHEMISTRY: 


VaeS ASHOH SQOSOOOWDANILNY NI SNVALXHYC AGNV SASOIN IDA 10d DJILAHLNAS dO SNOILOVAY SSOUD 


& WIAVL 


39, 1953 


VOL. 





$5 BIOCHEMISTRY: HEIDELBERGER AND AISENBERG _ Proc. N.A.S. 


As noted in tables | and 3 and figure 1, fraction E gave considerably less 
precipitate with antisera than did the others. 

In table 2 are shown qualitative tests of other samples from Merck and 
duPont,* some of them fractionated, as 52R2108, 9, others partially hy- 
drolyzed and/or prepared with different catalysts. Table 3 contains data 
on the total antibody (antibody to the group-specific ‘‘C’’ polysaccharide as 
well as type-specific antibody) precipitated by many of the fractions. 
Antibody determinations were carried out mainly at O°C. according to 

TABLE 3 
QUANTITATIVE DATA ON HOMOLOGOUS AND CROSS-REACTING ANTIBODIES IN VARIOUS 
ANTIPNEUMOCOCCUS HORSE SERA 
ANTIBODY N ANTIBODY N 
PPTD. BY PPTD. BY 


52R2109 52r2108 ANTIBODY NITROGEN 
FROM FROM PPTD BY 52R611 


MI 
MI 


UN 
M 
FROM 


M, TYPE 


ML 
ML 
ML 


MI 
ANTIBODY N PPTD. BY 


UNABSORBED SERUM, 
UNABSORBED SERUM, 


FRACTION D FROM 
uG./MI 
FRACTION E 


ANTISERIL 
ANTIBODY N 
ABSORBED SERL 
uG 
ANTIBODY N In C 
ABSORBED SERUM 
BG 
UNABSORBED, uwG 
: C-ABSORBED SERUM 

2 HG 

— UNABSORBED, wG 
C-ADSORBED SERUM, 
uG 
FRACTION A FROM 
UNABSORBED SERUM, 
FRACTION B FROM 
UNABSORBED SERUM, 


1000 
1061 
1030* 5 { 428 j 25 . £156 
570% 


1655 
704 
1780 
1853° 
1544 


YOO" 
883/ 


2218 20 
] 168 108s gl 240 112 28 
XXII 847 14 122 280 170 30 2 l 
“ 850 with specific polysaccharide of pneumococcus Type VII (S VII) treated with HNO» to remove 
C-substance. Supernatant gave 365 wg. with C; this supernatant gave 7 wg. N with A 
© 1764 with specific polysaccharide of pneumococcus Type XII (S XII) treated with HNO» to remove 
C-substance Supernatant gave 149 wg. with C; this supernatant gave no ppt. with A 
Fraction A’ gave values slightly higher than those with Fraction B 
“ The supernatant gave 1143 yg. antibody N with S IX; total, 1713 
The supernatant gave 950 wg. antibody N with S XII 20 mg. Fraction D per ml. antiserum, in the 
antigen-excess zone of complete inhibition, gave rise to 85% inhibition of precipitation by 2 mg. Fraction 
A. precipitation by S XII was not affected 
! 776 uy. N were precipitated from the C-absorbed serum by Fraction A 


reference |! or, with smaller quantities, with estimation of N by the Mark- 
ham method.'? A few analyses were also run at 37°C. for comparison 


(Fig. 1), since cross reactions are normally more influenced by differences in 


temperature than are the homologous reactions.® 

Discussion. It is evident that the synthetic polyglucoses give precipi- 
tates in some, but not all, of a wide range of antipneumococcus horse sera. 
That it is actually the antibody in these sera which is precipitated is shown 
in several ways: first, by the smaller amount of nitrogen thrown down 





Vou. 39, 1958 BIOCHEMISTRY: HEIDELBERGER AND AISENBERG $57 


after removal of antibodies to the group-specific “‘C’’ substance of pneu- 
mococcus, and second, by more detailed analyses, mainly with Type XII 
serum. These showed that after removal of the antibodies with the specific 
polysaccharide of Type XII pneumococcus and with pneumococcal “C’’ 
substance the residual serum gave only negligible precipitation with the 
otherwise maximally reactive polyglucose fraction A. Also, in the 
antisera to several types, the amount of antibody directly precipitable by 
the pneumococcal polysaccharides usually agreed well with the sum of the 
quantities precipitated first by a polyglucose and then by the pneu- 
mococeal carbohydrates (cf. footnotes to table 3). 





a 
° 
rs) 


8 


3 





z 
2 
a 
w 
” 
a 
= 
0 
° 
x 
w 
a 
Q 
- 
a 
a 
z 
> 
fe} 
° 
Q 
» 
z 
<« 
° 
_ 





i 
2.0 30 


ANTIGEN ADDED 
FIGURE 1 





Reactions of polyglucose fractions A, A’, B, D, E with antipneumococcus 
Type XII Horse Serum, No. 625 


The reducing values of the fractions given in table | indicate a similar 
proportion of such end groups, although the viscosities point to large differ- 
ences in size or structure, or both. In view of the rather random distri- 
bution of mainly 1,4- and 1,6-linkages found by Pacsu and his coworkers! 
in unfractionated material with molecular weight averaging 180,000 (by the 
method of light scattering) it is possible, though not certain, that the prod- 
ucts of highest viscosity represent those of largest molecular size. In 
general, it is these which precipitate the most antibody from the various 
sera. Qualitatively this applies as well to the three dextran samples, of 
which the least reactive is the partially hydrolyzed clinical sample, N150N. 
t 


The polyglucose sample, P218-56,* is comparable with this dextran in 





458 BIOCHEMISTRY: HEIDELBERGER AND AISENBERG — Proc. N. A.S 


average molecular size, but shows considerably lower serological activity 
except in Type XII antiserum. 

According to the quantitative theory of the precipitin reaction which 
describes the mutual precipitation of antigen and antibody in terms of the 
multivalence of each reactant® one would expect a cross reaction to be more 
complete with increasing size of the antigen molecules. Larger numbers of 
reactive groupings should overcome hindrances to precipitation due to the 
less perfect fit of antigen and antibody than in the homologous reaction. 
However, Kabat and Berg*® have shown that partially hydrolyzed clinical 
dextrans do not necessarily precipitate less antidextran from the sera of 
human beings than do undegraded samples. Further chemical data on 
both the polyglucose fractions and the various dextrans are therefore 
needed before the serological behavior of these substances can be fully 
explained. 

Although the viscosities and reducing values of fractions D and E are 
almost identical their serological reactivities are not the same. Whether 
or not there are differences in the average sizes of the molecules in the two 
fractions, the presence of separable fractions with structural differences 
appears indicated. 

The chemical basis for the quite extensive cross reactivity of some of the 
synthetic polyglucoses with antibodies specific for pneumococcal ‘‘C”’ 
polysaccharide and the type-specific polysaccharides of pneumococcal 
Types IX, XII, XX, and XXII is as yet uncertain. Since even the com- 
ponent sugars of these five bacterial carbohydrates are unknown it would 
seem premature to theorize until current attempts to supply at least a por- 
tion of the necessary knowledge are further advanced. Stacey has, however, 
pointed out that the specific polysaccharide of Type II pneumococcus and 
one of the dextrans contain branching points of glucose units with linkages 
in the 1, 4, and 6 positions.'' If the reactions of the polyglucoses described 
above were due to the multiple recurrence of the same linkages one would 
expect the glycogens and amylopectins to give precipitates in antisera to 
the same types. The positive outcome of experiments to test this surmise 
will be described in a separate communication. 

Summary.—Cross reactions of synthetic polyglucoses with antibodies to 
pneumococcal C-substance and type-specific polysaccharides are recorded 
and compared with similar reactions of several dextrans. A partial inter- 
pretation of the results is given. 

* Carried out with the aid of the Harkness Research Fund and a grant from the 
Rockefeller Foundation. Generous gifts of antisera by the Bureau of Laboratories, 
New York City Department of Health, and the Division of Laboratories and Research 
of the Department of Health, State of New York, are also gratefully acknowledged. 

t The writers are greatly indebted to Merck & Co., Inc., for these and other samples 
and data, and especially to Dr. Arthur J. Zambito of Merck & Co., Inc. for his many 


courtesies 





VoL. 39, 1953 BIOCHEMISTRY: SCHMITT ET AL 459 


t Acknowledgment is due E. I. du Pont de Nemours & Co., Inc., for this sample and 
relevant data 

' Heidelberger, M., and Avery, O. T., J. Exptl. Med., 38, 73 (1923); 40, 301 (1924) 

2 Hotchkiss, R. D., and Goebel, W. F., J. Biol. Chem., 121, 195 (1937); Reeves, R. E., 
and Goebel, W. F., /bid., 139, 511 (1941) 

’ Haworth, N., Kent, P. W., and Stacey, M., /. Chem. Soc., 1948, 1220; the poly- 
saccharide described /bid., 1211, cannot contain amino sugar, since it has been obtained 
nitrogen-free. Heidelberger, M., and Menzel, A. E. O., J. Biol. Chem., 118, 79 (1937) 

* Reviewed in Heidelberger, M., Bact. Rev., 3, 49 (1939); Chem. Rev.,. 24, 323 (1939) 

5 Heidelberger, M., and Kendall, F. E., J. Exptl. Med., 61, 563 (1935) 

§ Heidelberger, M., Kabat, E. A., and Shrivastava, D. L., Jbid., 65, 487 (1937); 
Heidelberger, M., Kabat, E. A., and Mayer, M., /bid., 75, 35 (1942) 

7 Sugg, J. Y., and Hehre, E. J., J. Immunol., 43, 119 (1942); cf. also Zozaya, J., 
J. Exptl. Med., 55, 353 (1932). 

* Kabat, E. A., and Berg, D., J. Immunol., May, 1953. In this paper it is shown, 
among other things, that two synthetic polyglucoses partially precipitate antibodies 
formed in man after injection of dextran. In part, also, in Ann. N. Y. Acad. Sci., 55, 
471 (1952) 

® Heidelberger, M., MacLeod, C. M., Markowitz, H., and Roe, A. S., J. Exptl. Med., 
91, 341 (1950) 

” Heidelberger, M., and Kendall, F. E., 7bid., 53, 625 (1931) 

'! Heidelberger, M., and Kendall, F. E., /hid., 61, 559 (1935) 

'2 Markham, R., Biochem. J., 36, 790 (1943); cf. also Kabat, E. A., and Mayer, M. 
M., Experimental Immunochemistry, C. C Thomas, Springfield, Ill., 1948 

'S Pacsu, E., personal communication 

4 Stacey, M., Biochem. J., 53: Proc. Biochem. Soc., xiii (1953); Endeavour, 12, 38 
(1953) 


A NEW PARTICLE TYPE IN CERTAIN CONNECTIVE TISSUE 
EXTRACT S* 

By FRANCIS O. SCHMITT, JEROME GROSS, AND JOHN H. HIGHBERGER 

THE DEPARTMENT OF BIOLOGY, MASSACHUSETTS INSTITUTE OF TECHNOLOGY, CAM 

BRIDGE; THE DEPARTMENT OF MEDICINE, MASSACHUSETTS GENERAL HospitaLt 

AND HARVARD MEDICAL SCHOOL, BOSTON; AND THE UNITED SHOE MACHINERY Cor- 


PORATION, BEVERLY, MASSACHUSETTS 
Communicated April 30, 1953 
Chemical and electron microscopic studies of the in vitro reconstitution 


of fibrils from acid solutions of collagen have revealed that the form of the 
axial repeating structure in the fibril is markedly influenced by the physical 


chemical environment. This paper describes a new structure precipitated 


from collagen solutions, the organization of which is regarded as having 
fundamental significance for collagen structure. 

Early studies! ° patterned after the classic Nageotte experiment® dem- 
onstrated that the addition of NaCl to dilute acid solutions of certain forms 





160 BIOCHEMISTRY: SCHMITT ET AL Proc. N. A. S. 


of collagen, such as rat tail tendon or ichthyocol,’ precipitate fibrils which, 
when viewed in the electron microscope, exhibit a very regular axial re- 
peating period. Neutralization or addition of 0.1-0.2 VM solutions of 
salts with monovalent cations precipitated fibrils with the axial period and 
detailed intraperiod fine structure characteristic of native collagen fibrils. 
At somewhat higher ionic strength the precipitated striated fibrils had 
periods of about one-third the normal, or 220 A. At still higher salt con- 
centration the fibrils formed were structureless. * * 


SEGMENT LOBGeSPACIBGOS 


Phesthate extract of calf corius 


' 
| Mean - 2000 A 
(301: Segnente) 









































Ft TY SERUTTT — re rl 
1600 1800 2000 22 2 A 
FIGURE 1 


Frequency distribution curve of lengths of segments produced from phosphate extracts 
of calf corium. 


Following the procedure for preparing ‘‘procollagen’’ described by Tus- 
tanovsky® and Orekhovich, ef a/.,'° whereby skin and other tissues are ex- 
tracted with acid citrate buffer and the extract dialyzed against water, the 
present authors found that the precipitate which forms on dialysis contains, 
besides typical collagen fibrils, a new fibril type.'! This fibril, having an 
axial period ranging from 1800 to 3000 A, was called the “‘long-spacing”’ 
or “LS” form. To distinguish between this structure and the new struc- 





VoL. 39, 1953 BIOCHEMISTRY: SCHMITT ET AL. 161 


ture to be described below, we shall refer to it as ‘‘fibrous long-spacing”’ 
or “FLS.” 

FLS could be produced also from acetic acid filtrates of fish swim bladder 
collagen (ichthyocol) by the addition of plasma a-1 acid glycoprotein fol- 
lowed by dialysis.» The acid filtrates contain about 0.1% collagen. At 
glycoprotein concentrations of about 0.02% or higher, FLS are formed al- 


most exclusively. At lower concentrations (ca. 0.001%) of glycoprotein, 
chiefly collagen-type fibrils are reconstituted. At intermediate concen- 


] 


MEAN 1760 A 
(315 Segments 


5 


| 


Bide ON AAR NIG pCa apes, ‘egepeamuanaies 


SEGMENT LONG-SPACIANGS 


from a Phosphate Extract of Ichthyocol Collagen 




















4 


FIGURE 2 


Frequency distribution curve of lengths of segments obtained from phosphate 
extracts of carp swim bladder tunic 


trations both types are formed. FLS can be converted into the collagen 
type by dissolving in 0.05°7 acetic acid and dialyzing against 1.0% NaCl. 

Although glycoprotein is highly efficient in producing FLS and collagen, 
it is not unique in this property; a number of other substances will accom- 
plish the same result. It was therefore concluded'’ that the components 
necessary for the formation of cross-striated fibrils, whether of the collagen 
or FLS type, are contained in the collagen extracted from the connective 
tissue. The precipitating substances seem to be relatively non-specific 
under the conditions of the experiment." 





462 BIOCHEMISTRY: SCHMITT ET AL. Proc. N. A. S. 


Detailed intraperiod structure may be observed in LS fibrils. At least 
14 bands of characteristic position and density have been observed in prep- 
arations stained with phosphotungstic acid (PTA). The pattern of 
banding (Fig. 5) is strikingly different from that of collagen (Fig. 6) not 
only in the relative positions of the bands but also in being non-polarized or 
symmetrical in the axial direction. 

It is generally assumed that collagen is composed of protofibrils or poly- 
peptide chains with sequences of amino acids running in the same direction 
in adjacent chains and with discontinuities in register, thus forming a cross- 


MEAN = 24640 A 
(1,123 segrents) 


SEGMENT LONG-~-SPACINGS 
Ichthyocol Collagen plus ATP 
Suspended in Dilute Acetic Acid 














TSO bala S100 TIO A 
FIGURE 3 














Frequency distribution curve of lengths of segments produced by the interaction of 
ATP and ichthyocol filtrate, 


striated fibril with polarized structure. It is possible that the symmetry of 
FLS structure is produced by a packing of polarized chains oriented in both 
directions (antiparallel array). 

Using 70,000 as the molecular weight of procollagen and 600 the number 
of amino acid residues, Bresler, et a/.,'* concluded that procollagen must 
be a highly coiled chain since they deduced that the length of the molecule 
is only 380 A. If the helix were uncoiled the molecule would have a length 
of the order of 2000 A. The suggestion has been made” that in LS fibrils 
the chains are uncoiled, being essentially in a 6 state. For this there is as 
yet no direct evidence. Although high-angle x-ray patterns of FLS prep- 





VoL. 39, 1953 BIOCHEMISTRY; SCHMITT ET AL. 463 


arations displayed the teatures characteristic of normal collagen, more 
conclusive evidence is required to prove or refute this point of view. 
Substantially all of the collagen in acid solutions of ichthyocol may be 
converted to FLS if the concentration of acid glycoprotein, added before 
dialysis, is sufficiently high. This suggests the possibility that thin parti- 
cles having the length of the FLS period, i.e., about 2000 A, may oceur in 



























































FIGURE 4 
Ultraviolet absorption spectra of: 1. Phosphate extract of calf corium; 
Phosphate extract of carp swim bladder tunic 


the collagen solutions or may be dissociated from a higher polymer. The 
fact that segments of the order of 2000 A in length were occasionally ob- 


served in “‘procollagen”’ solutions (Fig.7) lends support to this view. Accord- 
ingly, efforts were made to isolate the material of which such elongate 


particles are composed. 
Earlier experiments® indicated that FLS could be produced from mildly 





164 BIOCHEMISTRY: SCHMITT ET AL. Proc. N. A.S 


alkaline phosphate extracts of skin and other connective tissues. Dialysis 
of such extracts produces an amorphous precipitate; extraction of this 
precipitate with citrate buffer (pH = 3.8, « = 0.2), followed by dialysis, 
leads to the formation of FLS together with collagen-type fibrils. 

A slight variation of this procedure led to the discovery of an entirely 
different and new long-spacing type. This involved the dialysis of a phos- 
phate extract (pH = 8, w = 0.4) against citrate buffer (pH = 4, u = 0.2) 
rather than the citrate extraction of the dialyzed precipitate of a phosphate 
extract. Shortly after the beginning of dialysis a white copious precipitate 
begins to form. In the electron microscope this material was seen to con- 
tain (besides variable amounts of unstructured material and thin filaments) 
a new component in the form of segments having a characteristic pattern 
of internal band structure (Fig. 8). 

In figures | and 2 are shown the distribution curves of lengths of seg- 
ments obtained from phosphate extracts of calf corium and carp swim 
bladder tunic. The curves differ with regard to the mean length and dis- 
persion. The reasons for the differences are not yet clear; however, they 
serve at least to establish an order of magnitude for the segment length. 
The segment lengths clearly fall within the range of periodicity of the fibrous 
long spacings. These new structures will be called “segment long spac- 
ings” or “SLS.” 

Further characterization of the material precipitated as SLS required 
considerable purification. It proved very difficult to separate SLS from 
contaminants in skin extracts, either before or after precipitation. Far 
more satisfactory as starting material is the tunic of the carp swim bladder 


after preliminary extraction with dilute acetic acid or acid citrate buffer. 
In such preparations the finely filamentous contaminant is present only in 


FIGURES 5-10 (P. 465) 

Fig. 5. Long spacing fibril (FLS) produced by dialysis of a mixture of a-1 acid 
glycoprotein and ichthyoco! filtrate. Note the detailed symmetrical intraperiod fine 
structure. PTA stained. Mag. 44,500 

Fig. 6. Collagen fibril of cow hide showing characteristic polarized intraperiod band- 
ing. Stained with PTA. Mag. 150,000. 

Fig. 7. Segments of LS produced early in the dialysis of a citrate extract of newborn 
rat skin against water. Note similarity of intraperiod fine structure with that of FLS 
(Fig. 1). Mag. 24,000 

Figs. $and 8A. Segment long spacings obtained by dialysis of a phosphate extract of 
calf corium against citrate. Mag. 23,000. Note detailed asymmetric fine structure 
particularly in the enlarged pair of segments joined end to end in 8A. Mag. 44,000X. 

Figs. 9 and 9A. Segments obtained from phosphate extracts of carp swim bladder 
tunic which had been previously extracted with dilute acetic acid. PTA stained and 
lightly shadowed with chromium. Mag. 18,000. More detailed fine structure is 
illustrated in 9A. (PTA alone). Mag. 80,000. 

Fig. 10. Segments, produced by the addition of ATP to an ichthyocol filtrate. PTA 
stained and lightly shadowed with chromium. Mag. 24,000. 10A is a higher mag- 
nification. Mag. 49,000X. 











166) BIOCHEMISTRY: SCHMITT ET AL. Proc. N. A. S 


very small amounts. An electron micrograph of purified SLS produced 
from a phosphate extract is shown in figure 9. 

It will be noted that the SLS structure is polarized. ‘This is most easily 
noticed by observing the interband region of very low density which oc- 
curs slightly off-center (Figs. SA and 9A). As many as 18 bands have been 
observed in individual segments. 

Phosphate extracts both of skin and of acid extracted swim bladder 
tunic manifest strong absorption at about 2600 A and a minimum at about 
2375 and 2450 A (Fig. 4) suggesting the possibility that nucleic acid or 
nucleotides may play a role in the formation of SLS. It is true that certain 
solutions such as phosphate extracts of salt-reconstituted ichthyocol, which 
yield SLS upon dialysis against citrate buffer, show no absorption maximum 
at 2600 A. Nevertheless it seemed worth while to examine the efficacy of 
nucleic acid and various nucleotides in forming SLS from collagen solutions. 

Yeast ribonucleic acid and DNA dissolved in 0.05 M citrate, pH = 4, and 
also in water, were added to ichthyocol filtrate to make concentrations 
ranging from 0.005 to 0.6%. Immediate precipitation of structureless 
filaments, but no SLS, occurred in ail cases. 

Adenosine-3-phosphoric acid, adenosine-5-phosphoric acid, uridylic, 
and guanylie acids in 0.2%, concentration produced no precipitate. Hex- 
ametaphosphoric acid, sodium pyrophosphate, and the sodium salts of 
ADP and adenosine triphosphate produced fluocculent precipitates con- 
taining collagen-type or structureless fibrils immediately after mixing. 
No SLS or FLS were formed. 

However, when adenosine triphosphoric acid (ATP), dissolved in 0.05; 
acetic acid, was added in 0.1 to 0.25°% final concentration to ichthyocol 
filtrate, precipitation started almost immediately and continued for several 
hours in the cold. These precipitates are composed exclusively of SLS 
although some very finely filamentous material is present. Their structure 
(Figs. 10 and 10A) is essentially identical with that of SLS obtained from 
phosphate extracts of skin and swim bladder. Figure 3 is a distribution 
curve of segment length in such a preparation of ATP-induced SLS. 
There is considerable variability in segment width from one preparation to 
the next and ofttimes within the same preparation as shown in figures 8 


FIGURES 11-14 

Fig 11. ATP-produced segments washed in dilute acetic acid. Fraying indicates 
the parallel array of filamentous components composing the segment. Mag. 42,000X 

Fig. 12. ATP-produced segments revealing great variability in width and also empha- 
sizing the fibrous nature of the components. Mag. 21,000. 

Fig. 13. ATP-induced SLS showing several standing on end. Calculation from 
shadow length indicates a height nearly equal to the length of those segments lying flat 
Cheir three dimensional quality is also apparent. Mag. 12,000 

Fig. 14. SLS transformed to collagen after dissolving in NaCl and dialyzing against 
the same salt. A small amount of amorphous debris is usually present. Mag. 27,000X. 





AL. 


~ 
x 
“~ 
~ 
~ 
~ 
~ 
~ 
i) 


BIOCHEMISTRY: 


‘ 

9 Tran tses* 

sytacel- c 

: peas ese Cee rVre 

‘ % 4) 
ca Pt iat eovgepesatey ts. 
ina Jf rege 

£theens g ee ~y 

Shanes j 
ttitteses,, 


. 
“*SSan, &<¢ 


1 


FIGURES 11 





168 BIOCHEMISTRY: SCHMITT ET AL. Proc. N. A. S 


and 12. Frequently one observes segments standing on end, as in figure 
14, and it is obvious that these structures are not flat ribbons but have sig- 
nificant thickness. This is also evident from the shadows cast by those 
segments lying flat. Preliminary study indicates that SLS is very highly 
hydrated. Application of the freeze drying technique of Williams'® may 
provide interesting information on the shape of these structures. 

Washing the precipitate several times with fresh 0.1% ATP in 0.05%; 
acetic acid further removed the earlier described contaminant and_pro- 
vided a highly purified preparation of SLS. When such purified SLS is 
exposed to acetic acid in the same concentration in which they were formed 
(0.05%, pH 4.9) the segments begin to disintegrate. This process is in 
itself of considerable interest for, when followed with the electron micro- 
scope, it lends strong support to the view that the segments are composed 
of a parallel packing of very thin fibrous particles of uniform length (Fig. 
11). It may also be seen that the banded structure is still evident in some 
of the thin seginent frays. 

For present purposes it seems reasonable to assume that the chains in 
the SLS are in parallel array and that ATP molecules tend to bond neigh- 
boring chains laterally. More evidence is needed before a definitive sug- 
gestion can be made regarding the specific nature of this bonding. Pre- 
liminary estimates indicate that SLS preparations which had been washed 
with acid may contain as much as 1207 of ATP which would correspond to 
about one mole of ATP per 3278 grams of protein. 


Particularly pertinent is the question of the relationship between col- 
lagen, fibrous LS, and segment LS. Can they be converted one into the 
other and, if so, by what means? Experiments outlined below show that 


all of these forms are interconvertible. 

Conversion of SLS to Collagen. When a pure preparation of SLS is 
dissolved in 0.17 47 NaCl and dialyzed at room temperature against NaCl, 
a heavy fibrous precipitate forms which is composed entirely of well- 
structured collagen fibrils with the characteristic collagen banding and a 
small amount of granular debris (Fig. 14). The conversion to collagen is 
accelerated at temperatures up to 37°C. 

Conversion of Collagen to SLS. Collagen-type fibrils with characteristic 
banding may be obtained by dialyzing an acid filtrate of ichthyocol against 
a Il©) NaCl solution. This precipitate of collagen fibrils, after removal by 
centrifugation, was dialyzed exhaustively in the cold against 0.05% acetic 
acid. The precipitate dissolved to form a tenuous gel. This was diluted 
to the original volume with 0.05°% acetic acid and ATP added to make a 
concentration of 0.207. The precipitate which formed was nearly pure 
SLS. 

Conversion of Fibrous LS to Segment LS. LS fibrils, produced by dialysis 
of a mixture of ichthyocol filtrate and acid glycoprotein (procollagen), may 





VoL. 39, 1958 BIOCHEMISTRY: SCHMITT ET AL. 169 


be dissolved in acetic acid. Dialysis of such a solution against 1°) NaCl 
produced collagen fibrils while addition of ATP produced SLS. 

These results demonstrate that the various forms of fibrous ‘‘collagens”’ 
may be freely converted, one into the other. The likelihood that thin 
fibrous protein particles about 2200 A long, or polymers thereof, exist in 
various extracts of connective tissue is indicated both by the action of 
glycoprotein, to form the symmetrical fibrous LS, and by the action of 
ATP, to form the polarized, asyinmetric segment type of LS. 

Preliminary chromatographic analyses reveal that the amino acid pat- 
tern of SLS protein resembles that of collagen fairly closely. Tyrosine 
values are low and this amino acid may actually be absent. Hydroxy- 


proline is present in ichthyocol SLS in a concentration approximately 90% 


of that found in ichthyocol collagen. After allowance is made for the ATP 
nitrogen, the nitrogen content of the SLS protein was estimated to be 
18.20%, in good agreement with that of collagen. Films of SLS give a 
high-angle x-ray pattern characteristic of collagen. 

Neither fibrous nor segmental LS have thus far been observed in tissues. 
This may be due to the relatively high ionic strength of tissue fluids and to 
the fact that the long particles may not exist in the uncombined state to 
any appreciable degree either in the ground substance or in the formed fi- 
brils of the connective tissue. This problem is being investigated. 

The possible range of conditions under which SLS may form has not yet 
been determined. It would be premature to implicate ATP itself 
or other nucleotides specifically in their formation from native tissue ex- 
tracts. The presence of purine- or pyrimidine-containing compounds in 
the phosphate extracts may conceivably be coincidental. It is quite pos- 
sible that the long protein particles may be capable themselves of aggre- 
gating in the manner characteristic of SLS when the physical chemical 
environment is appropriate. The possible relationship of these particles 
to collagen precursors is at present a matter for speculation. 

Elongate particles or molecules having lengths of the order of 1500 to 
3000 A have been deduced to exist in tropomyosin, L-myosin,'* paramy- 
osin,'’and keratin.” It is possible that application of the type of technique 
described above may permit direct electron microscopic visualization of the 
elongate particles through their reaction with a compound like ATP which 
may unite chains of similar length and constitution laterally and form 
either segmental or fibrous LS structures, as in the case of collagen. 


Acknowledgment.—The authors wish to thank Mrs. Zelda Sokal, Mr. R. 
A. Whitmore, Mr. L. S. Allen, and Mr. J. W. Jacques for their technical 
assistance. 


* This investigation was supported in part by grants from the Eli Lilly Company to 





470 CHEMISTRY: J. DONOHUE Proc. N. A. S. 


the Massachusetts Institute of Technology and from the U. S. Public Health Service 
to the Massachusetts General Hospital. 

t This is publication No. 141 of the Robert W. Lovett Memorial Foundation for the 
Study of Crippling Disease, Department of Medicine, Massachusetts General Hospital 
and Harvard Medical School. 

' Schmitt, F. O., Hall, C. E., and Jakus, M. A., J. Cell Comp. Physiol., 20, 11-33 
(1942) 

? Bahr, G., Exptl. Cell Research, 1, 603-606 (1950). 

’ Vanamee, P., and Porter, K. R., J. Exptl. Med., 94, 255-268 (1951) 

' Noda, H., and Wyckoff, R. W. G., Biochem. et Biophys. Acta., 7, 494-506 (1951). 

® Gross, J., Proc. Josiah Macy Jr. Foundation; Conference on Metabolic Inter 
relationships, 4, 32-57 (1952) 

® Nageotte, J., and Guyon, L., Arch. Biol., 41, 1-385 (1930). 

7 Fauré-Fremiet and Garrault, H., Arch. Anat. mic., 33, 81-94 (1937) 

* Highberger, J. H., Gross, J., and Schmitt, F. O., Proc. Nati. Acap. Sct., 37, 286- 
291 (1951) 

® Tustanovsky, A. A., Biokhimiya, 12, 285-290 (1947). 

” Orekhovich, V. N., Tustanovsky, A. A., Orekhovich, K. D., and Plotnikova, N. E., 
Biochemiya, 13, 55-60 (1948) 

'! Highberger, J. H., Gross, J., and Schmitt, F. O., J. Am. Chem. Soc., 72, 3321 
(1950) 

12 Gross, J., Highberger, J. H., and Schmitt, F. O., Proc. Soc. Exptl. Biol. Med., 80, 
162-465 (1952) 

'S Tt should be noted, however, that a wide variety of proteins and other high polymers 
will not reproduce this phenomenon 

'4 Bresler, C. E., Finogenov, P. A., and Frenkel, S. Y., Repts. Acad. Sct. U.S.S.R., 


6 Bear, R. S., Adv. Protein Chem., 7, 69-160 (1952). 

' Williams, R. C., Biochem. et Biophys. Acta, 9, 237-239 (1952). 

Bailey, K., “The Muscle. A Study in Biology and Pathology,” L’Expansion Sci 
Francaise, 31-46 (1950). 

'’ Weber, H. H., Adv. Protein Chem., 7, 161-252 (1952). 

'S Hodge, A. J., Proc. NATL. ACAD. Sc1., 38, 850-855 (1952). 

2” Mercer, E. H., and Olofsson, B., J. Polymer Sci., 6, 671-680 (1951) 


HYDROGE.Y BONDED HELICAL CONFIGURATIONS OF THE 
POLYPEPTIDE CHAIN 
By JERRY DONOHUE 


THe MepIcAL RESEARCH CouNcIL UNIT FOR THE STUDY OF THE MOLECULAR STRUCTURE 
OF BIOLOGICAL SYSTEMS, CAVENDISH LABORATORY, CAMBRIDGE, ENGLAND 


Communicated by Linus Pauling, April 13, 1953 


Pauling and Corey'~* have formulated two helical configurations for 
polypeptide chains in which: (1) the residues have the dimensions deduced 
by consideration of the results of crystal structure studies in amino acids 
and related compounds,‘ (2) all residues are equivalent, and (3) each 
residue forms an N—-H---O hydrogen bond of length about 2.75 A in 





VoL. 39, 19538 CHEMISTRY: J. DONOHUE 171 


which the nitrogen-oxvgen vector is not more than 30° from the nitrogen- 

hydrogen vector. An important feature of the residues is the planarity 
C H 

of the atoms pe NY , in each. The two helices were termed by 
O Cc 

Pauling and Corey the a-helix and the y-helix. A convenient system for 

assigning an unambiguous designation to helical polypeptide structures 

with equivalent residues is that of Bragg, Kendrew and Perutz.® In 

this system each helix is designated by two numbers, Sp, where S denotes 

the screw symmetry and R the number of atoms in the hydrogen bonded 


FIGURE FIGURE 2 


The 2.2; helix The 3.0 helix 


ring. The numbers R are of two families, being equal to 3n + 4 for rings 
§ eq 


of the type --C—(NH—-CHR—CO),—N—., and 3m + 5 for rings of the 


type —-C--(CHR—NH-—-CO),-CHR—N—. The a-helix is then 3.6), 


and the y-helix is 5.1,7. For a helix of given R, the conditions postulated 


by Pauling and Corey in some cases render the construction of a stereo- 
chemically satisfactory helix impossible, otherwise they determine S 


within rather narrow limits. 





CHEMISTRY: J. DONOHUE Proc. N. A. S 


Pauling and Corey stated that the a and y helices were the only solutions 
to the problem which conformed with their basic postulates. Upon 
systematic examination of all structures with R < 17 it has been found, 
however, that there are four additional helical configurations of poly- 
peptide chains which also conform to these postulates, to the degree 
described below. One of these, having designation 2.2; and an axial 
translation of 2.75 A per residue, resembles the ‘‘a;;’ structure first sug- 


TABLE | 
ATOMIC COORDINATES FOR THE 2.2;, 3.0\), AND 4.315, HELICES 


x y 2 


OO A 


c 
N 

H 

c’ 

O 23 
iti 

BC 2.31 
BC-: 23 


oa 

OS 

07 

O5 

O7 

Ol 

66 

66 2.38 45 


to 


C 
N 

H 
C’ 
O 
Lem 
BC-1 
BC-2 


to to bo 


a 


whe 


Co Se 


tONm Wh tw hw et 


gested by Huggins.® This is a structure which has been the subject of 
discussion in a number of papers since then, not all of which need be 
referred to here. The relation of the present version with some of those 


put forward previously is discussed below. 

The existence of a second, -helix, has been pointed out by Low and 
Baybutt;’ this helix has the designation’ 4.4;5 and an axial translation of 
1.15 A per residue. The two other structures have designations 3.0) 





VoL. 39, 1953 CHEMISTRY: J. DONOHUE 473 


and 4.3\4,, and axial translations of 2.00 A and 1.20 A per residue respec- 
tively. Atomic coordinates for the 2,27, 5.0\, and 4.3), helices are presented 
in table 1. Drawings for the four structures are shown in figures 1, 2, 
3, and 4. (The coordinates used in preparing the drawing for the 4.416 
helix were determined independently in this work.) 

Studies were not made of helices with R greater than 17, because an 


unfavorable situation with regard to van der Waals attraction is expected 
for them. The two lower members of the R = 3n + 5 family may be re- 
jected because of the great strain energies associated with even their most 
favorable formulation. For R = S and planar residues, the shortest 
possible N---O distance is 4.2 A. This can be reduced to an acceptable 


FIGURE 3 


The 4.445 helix The 4.3)4 helix. 


value by rotation of each residue about the C’--N bond by about 45°, 
thus introducing a strain energy of 15 keal. per mole per residue. This 
amount could be reduced somewhat by decreasing some of the bond angles, 
but only at the expense of additional strain due to bond deformation. The 
structure with R = 11 can be constructed only by allowing, first, a rotation 
through an angle 6 of the peptide groups about the C’-——-N bonds, and 
second, a decrease of the angle C’--C--N from 110°. While preserving 
a satisfactory hydrogen bond system it is found that minimum strain 
energy occurs at about 6 = 20°, angle C’-C--N = 97°. These values 
correspond to total strain energy of more than 5 kcal. per mole per residue, 
so great as to render this helix unacceptable also. A 3.6,; helix has been 





4174 CHEMISTRY: J. DONOHUE Proc. N. A. S. 


described recently by Huggins,’ and atomic coordinates for it have been 
published.'' This helix involves great deviation from the structural 
postulates; Pauling and Corey'® have discussed the amount of the strain 
energy. 

Of the numerous papers in which the a,,; ribbon (2.27 helix) has been 
discussed, only one, that of Bragg, Kendrew and Perutz,° gives coordinates 
for the atoms. It is therefore impossible to test the other models for their 
conformity with known structural principles, since sketches or photographs 
of models are not sufficient. Moreover, it is noteworthy that all former 
models possess an exact twofold screw axis, whereas with this restriction 
it is not possible to construct a satisfactory structure. For example, 
if the coordinates for the hydrogen atom are obtained by measurement 
of the drawings of the 2; structure of Bragg, Kendrew and Perutz,’ the 
N--H---O angle differs from a straight angle by 66°, and that the angle 
between the N--H and N---O vectors is 48°. A second example of an 
unacceptable structure for the a,,; ribbon (with a twofold screw axis) is 
that of Robinson and Ambrose,’ who investigated various chains with 
their space filling models. Although no coordinates were given, it is 
possible to derive from the data they do quote that the N--H---O angle 
differs from a straight angle by about 60°, the angle between the N——H 
and N---O vectors is about 40°, and, moreover, the N---O distance is 
about 2.56 A, more than 0.1 A shorter than any reliable value so far 
reported. 

A 3 helix which deviates widely from the structural postulates has 
been put forward previously by Huggins.* Here again it is difficult to 
determine the precise characteristics of this model, since no atomic co- 
ordinates were given, but it is clear from the drawings of it that the peptide 
groups deviate so far from planarity that it must be considered unaccept- 
able. Furthermore, 5.1 A was given as the axial repeat for 3 residues, as 
compared with 6.0 A for the present structure, and it has been found that 
the axial repeat of the 3.0, helix may not be shortened by as much as 
0.2 A (for 3 residues) without either leading to an improbably short N 
H---O distance or introducing other undesirable features. 

A helix with R = 14 does not appear to have been described before. 
Bragg, ef al.,° state that for 3,4 and 44 there are “no possible structures,” 
but these workers restricted themselves to integral values of S, the screw 
symiunetry. 

There are several short distances involving the two possible positions 
for the B-carbon atoms. In the 2.2; helix the distance from 8-carbon-2 to 
the next oxygen atom above it is only 2.6 A, and in the 4.34 and 5.17 
(y) structures the distance from 6-carbon-2 to the next oxygen atom 
below is even shorter, a situation which may eliminate these choices for 
the position of the B-carbon atoms. In the 3.6); (@), and 4.4) struc- 





VoL. 39, 1953 CHEMISTRY: J. DONOHUE 175 


tures the distances from B-carbon-2, and in the 2.2; structure the distance 


from 6-carbon-1, to the next oxygen below is 2.7 A, not far from the 


C---O distances observed in acetylglycine,'*® glutamine,'' glycylglycine,” 
and glycylasparagine,'® where C---O distances of 2.69 A, 2.76 A, 2.78 A, 


and 2.81 A respectively, are found. All these are close to what is expected 
when the 8C-——-C bond, or C——N bond in the case of the peptides, is cis to 
the C’—O bond. (In the 3.040, 4.315, and 5.1,7 helices this distance is 2.8 A 
or more, since the orientation of the 8C-——-C bonds is 20° or greater from 
cis.) A preference between the two 8-carbon positions cannot, therefore, 
be made on this basis. The position of the R groups in figures 1 to 4 cor- 
responds to 8-carbon-1. 

It is a matter of some interest to compare the relative stabilities of the 
six helices. Five features of the structures will be examined: (1) distor- 
tion from planarity of the peptide residues; (2) deformation in bond angles; 
(3) van der Waals contacts; (4) the orientations about the single bonds, 
C—C’ and C--N; and (5) nature of the hydrogen bond. In all models 
the covalent bond distances are within +0.02 A of the predicted values,‘ 
which in any case, are accurate to only about that amount. 

Quantitative estimates of the strain energies are made, where possible, 
as follows: (1) the strain energy due to a rotation of a peptide residue about 
its C’--N bond is given by the formula’ '* A sin? 6 where 6 = the angle of 
rotation and A = 30 keal. per mole; (2) for bending of single covalent 
bonds, the strain energy is calculated by assuming that the bond energy 
is proportional to the strength of the bond orbital in the bond direction;!* 
thus, for the case of distortion of a bond angle this leads to the formula 
1.5 B sin® 6’/4 where 6’ is the deviation of the bond angle from the value 
expected for maximum bond strength and B is the sum of the bond energies 
of the two bonds forming the strained angle; (3) quantitative estimation 
of van der Waals forces, either repulsive or attractive, has not been at- 
tempted but rather the van der Waals contacts in the different structures 
are listed for comparison with each other and with other known structures; 
(4) according to Pauling and Corey” there are expected to be six favored 
orientations about each of the C-—C’ and CN bonds, each differing by 
60°. The energy difference between a most favorable and most unfavor- 
able position has been estimated at about one keal. per mole, and may be 
calculated for intermediate positions by assuming that the potential 
function is a sine wave; (5) the energy”® of an N—-H---O hydrogen bond 
is of the order of 8 keal. per mole if the angle between the N--H and 
N---O vectors, i.e. <H--N---O, is close to 0°. For values of this angle 
larger than 30° the energy is assumed to be negligibly small. It is not 
known in what way the energy decreases as the angle increases. 

The results of the above considerations are presented in table 2. The 
hydrogen bonds are seen to be satisfactory in all six, the greatest deviation 





Proc. N. A. S. 


I. 


J. DONOHUE 


MISTRY: 


¢ 


CHI 


‘A[SnotaAsid paz sapided ayt ul y R27 


a G 
e 


OF ¥ L9°% Wor JO 9URYSIP O- + - D JE[Najoures {Ur ay} pue 4, ‘Spe}SAIO plow OUIUIe UI YW 69% OF ¥ #97 J9 O---N SPP}UOD AP[NoVoureIQUL YIM 
‘SOINJONIYS 9Y} 77) Ul aNpIsas apZuts e& UIQIIM SINDO YOIYM YT = O--+ + dD WPIUOS 9Y} YUM patedulo) dq ALU S9DUFISIP S[eR AY Jop UBA JT @ 


[BOY | BJqe 
“JOAL} WOOL OF 
*JBOY [ 9TQGe 
-IOAR} WOI (OF 


c 


auoN 


ouON 


oll 


‘TRO ZO aqe 
-JOARY WOOL 2] 
*‘[P94 ZO aIqGe 
“JOARY WOT COT 


£ 
‘T2410 
aad aaa 


ouOoN 
2) F 


SAHAALIAALS AGILdaAdATOgG 


el! 
VOR G 


"TROY T aTqu 
-IOARJ WOOL .CZ 
‘TROY | BTqQe 


-IOAR} WO (CZ 


O 

Yor NX 
‘TeE9F EO 
OOlL=N—->- 
p TPA TO 


‘anptsad Jod gow Jod "Jeoy ul UsAIZ ase SadudIayIp ABIoOUg |, 


oF of 
V OSs I8 


[POF KO FIGR 
-IOAV} WOOL .0Z 
[POY ZO AIqGe 
-JOAV} WO QI 


A QPIOAB 


a qe IAB LY 


N- 
° 


ou0ON oUuON 


aUuON QUON 


(2?) tig’g “oe 


of& O--*N—-H 213uy 
QZ e O-rrN SOURIS] 
:puog 
usz01pAy jo sinjen 


IQ e4IOA Lz | 
‘TROY CO AIqe 
-IOA Pj ul dy 


puoqg Po, % 


puogq N—d 
:JNogGRE uONnPWIIO 


g®591 430 ¥ Z'g JO 
S}OPJUOD Spee AY Jop uB| 
sozue 
UO PULIOJA] 
Ayueurd wo1y 
sanpisal jo 


dUON puoq jo 


auON uoneAsg 


« 


‘g 


IWOIMTEH XIC dO SALLIMAVLIS SAILVISY GALVWILSY 


@ ATEAVL 





VoL. 39, 1958 CHEMISTRY: J. DONOHUE 477 


from the ideal linear configuration being shown by the 2.2; structure. 
Although an angle of as much as 24° between the N—H and W---O 
vectors is not without precedent, it may be significant that this value is 
about as good as can be achieved for the 2.2; helix, whereas in crystal 
structures in which a similar deviation occurs there is always at least one 
other hydrogen bond of a more satisfactory nature. Omitting from 
consideration the van der Waals forces and hydrogen bond energies, it is 
seen that if the quantitative estimates are correct, the stability of the five 
structures is in the order 3.6);, 4.416(0.5), 2.27(0.5), 3.0(1.0), 5.17(2.0), 
and 4.3:4(2.4), where the numbers in parentheses give the estimated 
instability in keal. per mole per residue. Absence of appreciable van det 
Waals attractive forces in the 5.17 helix would lead to additional insta- 
bility.'* °° This effect also operates to a lesser extent with respect to the 
2.27, 4.314, and 4.445 structures, although this disadvantage might be over- 
come by the 2.2; helix because its flat character may enable two of them to 


rABLE 3 
STRUCTURAL CHARACTERISTICS OF HELICAL POLYPEPTIDE STRUC: URES 
ESTIMATED MINIMUM 
RESIDUES AXIAL TRANSLATION INSTABILITY PER 

ATOMS IN RING PER TURN PER RESIDUE MOLE PER RESIDUE 
7 (au ribbon) 2.5 2.75A 5 keal 
10 4 2 00 
13 (a) 3.6 1.50 
16 (9) 1.15 
8 No possible structure 
11 No possible structure 
14 1.3 1.20 
17 (y) §.1 0.98 2.0 


pack well side by side. The 3.0, helix is the tightest of the six, and it is 
possible that the contacts are so close that the repulsion forces counteract 
the attractive forces to such an extent as to render it unstable, although 
none of the distances is unreasonably short in comparison with known 
structures. Van der Waals radii, particularly for intramolecular distances, 
are not accurate enough to settle the question. It seems likely that the 
strain associated with the 4.3), helix is great enough to render its occur- 
rence improbable. The values 3° for the twist in the peptide group and 


100° for the angle C’--C-—-N represent the configuration of minimum 


strain energy for simultaneous variation of these parameters. It may 
be pointed out, on the other hand, that angles at a singly bonded carbon 
atom of as low as 104° have been observed in threonine! and as high as 
115° in glutamine,!'* values not far from those in the 4.3), and 4.44. struc- 
tures. It may well be possible that a combination of a favorable hydrogen 
bond situation plus unknown external influences may allow polypeptide 
chains to overcome small strains and to fold in one or another of the 





478 CHEMISTRY: J. DONOHUE Proc. N. A. S. 


configurations which, at first glance, seem to be rather less reasonable 
than the 3.6;; (@) helix. 

It is interesting that the four most stable helices are all members of the 
family R = 3n + 4, and it is possible to pass from one structure to the 
next by moving the hydrogen bonds along the chain. The structural 
characteristics of the various chains of both families are summarized in 
table 3. 

Radial distribution functions and x-ray scattering characteristics of 
these helical structures will be discussed in a subsequent paper. 


Acknowledgment..-This work was supported by a Grant of a Fellowship 
from the John Simon Guggenheim Memorial Foundation. 


' Pauling, L., and Corey, R. B., J. Am. Chem. Soc., 72, 5349 (1950). 

? Pauling, L., Corey, R. B., and Branson, H. R., these PROCEEDINGS, 37, 205 (1951). 
§ Pauling, L., and Corey, R. B., [bid., 37, 235 (1951). 

‘ Corey, R. B., and Donohue, J., J. Am. Chem. Soc., 72, 2899 (1950). 

5 Bragg, W. L., Kendrew, J. C., and Perutz, M. F., Proc. Roy. Soc., A203, 321 (1950). 
® Huggins, M. L., Chem. Rev., 32, 195 (1943). 

7 Low, B. W., and Baybutt, R. B., J. Am. Chem. Soc., 74, 5806 (1952). 

* Private communication from Dr. Low. 

® Robinson, C., and Ambrose, E., Trans. Far. Soc., 48, 854 (1952). 

” Huggins, M. L., J. Am. Chem. Soc., 74, 3963 (1952). 

'"! Huggins, M. L., lbid., 74, 3963 (1952). 

'2 Pauling, L., and Corey, R. B., [bid., 74, 3964 (1952). 

'3 Carpenter, G. B., and Donohue, J., [bid., 72, 2815 (1950). 

' Cochran, W., and Penfold, B. R., Acta Cryst., 5, 644 (1952). 

'® Hughes, E. W., and Moore, W. J., J. Am. Chem. Soc., 71, 2618 (1949). 

'6 Katz, L., Pasternak, R., and Corey, R. B., Nature, 170, 1066 (1952). 

Pauling, L., and Corey, R. B., these PROCEEDINGS, 37, 251 (1951). 

'§ Corey, R. B., and Pauling, L., Proc. Roy. Soc., B141, 10 (1953). 

'? Donohue, J., and Trueblood, K. N., Acta Cryst., 5, 419 (1952). 

*” Pauling, L., and Corey, R. B., these PROCEEDINGS, 37, 729 (1951). 
21 Shoemaker, D. P., Donohue, J., Schomaker, V., and Corey, R. B., J. Am. Chem. 

Soc., 72, 2328 (1950). 





VoL. 39, 1953 GENETICS: GILES AND PARTRIDGE 479 


THE EFFECT OF A SUPPRESSOR ON ALLELIC INOSITOLLESS 
MUTANTS IN NEUROSPORA CRASSA* 


By NorRMAN H. GILEs AND C. W. H. PARTRIDGE 
OSBORN BOTANICAL LABORATORY, YALE UNIVERSITY 
Communicated by P. R. Burkholder, March 30, 1953 


Extensive analyses of spontaneous and radiation-induced reversions in a 
series of independently induced allelic inositolless mutants of Neurospora 
crassa have revealed that most such inositol-independent types are the re- 
sult of reverse mutation.'* Furthermore, evidence was obtained that the 
inositolless mutants, which are phenotypically and biochemically indis- 
tinguishable," * can be separated into several groups on the basis of quan- 
titative differences in their spontaneous or induced reverse mutation rates. 
In addition to typical revertants to inositol-independence by reverse muta- 
tion, occasional phenotypically atypical revertants were recovered which, on 
crossing to wild type, segregated inositolless cultures in random ascospore 
isolation tests, indicating the probable occurrence of occasional suppressor 
mutations. The present paper deals with a more detailed genetic anal- 
ysis of one such suppressor type. In addition, tests of the effect of this 
suppressor on other inositolless mutants have been made. These studies 
demonstrate that the suppressor is highly specific, since it apparently does 
not suppress any of the other mutants. Consequently, these results serve 
to establish a further difference between the suppressed inositolless mutant 
and all other inositolless mutants tested to date. 

Origin, Genetic Analysis, and Biochemical Characteristics of the Sup- 
pressor Mutant._-The mutant described in these studies appeared following 


platings on minimal (inositol-free) medium of microconidia of inositolless 


strain 37401 (Stanford number) which had been exposed to ultraviolet. 
The mutant was initially noted as growing more slowly on minimal than the 
usual reverse mutation types. Even though the reversion had arisen in a 
microconidial strain, the mutant was next back-crossed to mutant 37401 
and an inositol-independent isolate recovered in order to be certain that 
all subsequent tests would be performed with a homocaryotic strain. 
This extracted inositol-independent culture was then crossed to a wild- 
type inositol-independent strain and serial ascospore isolations made and 
tested, with the results indicated in table 1. The recovery of inositolless 
types as indicated by the presence of 4:4 and 6:2 ascus segregations demon 
strates that the mutant being studied did not arise from reverse mutation, 
but rather was the result of mutation at some other locus. Such types in 
Neurospora have been designated, following the Drosophila terminology, 
suppressor mutants.‘ 

The recovery of the three segregating types indicated in table 1 pro- 





48() GENETICS: GILES AND PARTRIDGE Proc. N. A. S. 


vides material for comparing the growth characteristics of the suppressor 
with the original inositolless mutant and with wild type (inositol-inde- 
pendent). These also furnish cultures in which the suppressor is present 
with the wild type allele at the inositolless locus. Growth tests of a series 


of such cultures have been carried out (at 25°C.) in 125-ml. Erlenmeyer 
flasks, using minimal medium and minimal supplemented with 4 ug. 
inositol per ml. Pads were removed at 3, 6, and 8 days for dry weight 


determinations, three flasks being used to provide an average value for 
each determination. Growth curves for asci exhibiting 6:2, 8:0, and 4:4 
segregations are presented in figures 1, 2, and 3, respectively. The most 
instructive segregation is that shown in figure 1, the 6:2 type. In all 
instances the indicated genotypes were confirmed by appropriate crossing 
tests. It is immediately clear that all four genotypes are phenotypically 
distinguishable. It is also evident from the growth response in the culture 
from spore 16.3 that the suppressor only partially suppresses the effect 
of the inositolless mutation in 37401, since growth of the suppressor in the 
absence of inositol is considerably less than that obtained in the inositol- 
independent wild type. Further, added inositol markedly stimulates the 


TABLE 1 
RESULTS OF A CroSS OF REVERSION (HomMocARYOTIC CULTURE EXTRACTED FROM A 
Back Cross To 37401) witn Wi_p Type (INOosItoL-INDEPENDENT ) 


ASCUS SEGREGATION PATTERN NUMBER OF COMPLETE 
INOSITOL- INDEPENDENT - INOSITOLLESS ASCI TESTED 


4:4 2 
6:2 8 
8:0 2 


growth of the suppressor: The growth response of the suppressor tends 
to be rather variable when different stocks are compared in simultaneous 
experiments (e.g., compare spore cultures 6.1, 6.3, and 16.3), perhaps due to 
segregating modifiers. However, there is also a rather considerable vari- 
ability when the same stock is used in successive experiments. But in no 
instance has the growth of a suppressor strain closely approached that of 
wild type. 

In addition to the suppressor which is segregating with inositolless mu- 
tant 37401 in spore 16.3, two additional cultures of the expected types, 
inositolless 37401 alone (spore 16.5) and inositol-independent (spore 16.7), 
are present. The fourth culture, from spore 16.1, in which the suppressor 
is present with wild type, is phenotypically distinct and exhibits a growth 
pattern which could not have been predicted in advance. It is evident, 
however, from culture 16.1 as well as from cultures 15.5 and 15.7 (figure 3), 
that the presence of the suppressor results in a marked delay in the growth 
on minimal of the wild type strain and that this delay is not overcome by 
the addition of inositol. 





VoL. 39, 1953 GENETICS: GILES AND PARTRIDGE ISI 


Microbiological assays have indicated that the suppressor culture is 
actually synthesizing meso-inositol, but apparently at a much reduced rate 
compared with the inositol-independent wild type. Quantitative assays 
have been performed with mutant 37401 of Neurospora crassa and with 
Ashbya gossypi, for the first of which a marked specificity of response to 
meso-inositol apparently exists.° Following 21 days’ growth on minimal, 


O—---- MINIMAL 
@—— MINIMAL +! NOSITOL 


SPORE i611 (wT + $u) SPORE 16 S(INOS- 37) 





DRY WEIGHT (MG) 


SPORE |'6 3(INOS -37 + SU) SPORE |6.7(wT) 











DAYS DAYS 
FIGURE I 


Figures 1-3 show comparative growth curves obtained in liquid minimal and minimal 
supplemented with 4.0 yg. inositol per ml. (20 ml. in 125-ml. Erlenmeyer flasks) for 
cultures from complete asci from a cross of the suppressor of inositolless 37401 (inos 


37 + Su.) with wild type (W. T.). Segregations of inositol-independent: inositolless 
» 


cultures are as follows” figure 1, 6:2; figure 2, 8:0; figure 3, 4:4 

the mycelium of a suppressor strain was harvested (total dry weight was 
74 mg.) and both filtrate and mycelium were tested for inositol activity. 
No activity was present in the filtrate, but the following values were ob 
tained in micrograms of inositol per mg. of dried mycelium in duplicate 
sample tests: with the Neurospora assay, 0.19 and 0.14; with the Ashbya 


assay, 0.17 and 0.24. The value obtained for wild type mycelium in com 





GENETICS: GILES AND PARTRIDGE Proc. N. A. S 


TABLE 2 
TESTS FOR SUPPRESSION OF OTHER [NOSITOLLESS MUTANTS BY A SUPPRESSOR OF 
INOSITOLLESS 37401 (37401 +Su.) 


NUMBER OF COMPLETE ASCI WITH SEGREGATION TYPES 
INOSITOL-INDEPENDENT | INOSITOLLESS 
CROSSES TO INOSITOLLESS MUTANTS 44 6:2 8:0 


(A) Wild type + suppressor, culture 

No. 1, crossed with: 

37102 

16316 

1802 

64001 

89601 

37401 

1H. 2626 

J.H. 5202 

Wild type + suppressor, culture 
No. 2, crossed with: 

89601 

37401 


&-Spored asci only 


O----- MINIMAL 
@—— MINIMAL + INOSITOL 


SPORE 6.1 (INOS -37+ SU) SPORE 65 (w.T) 





SPORE 6 3(INOS-37+4SU) SPORE 6.7 (w.T) 


DRY WEIGHT (MG) 











FIGURE 2 





VoL. 39, 19538 GENETICS: GILES AND PARTRIDGE 183 


parable assays is approximately 2.0, indicating that the suppressor has 
only about one-tenth the mycelial concentration of inositol present in the 
wild type. 

Genetic Tests of the Suppressor in Combination with Other Inosttolless 
Mutants.—Following the demonstration that the reversion in 37401 was 
actually the result of a suppressor mutation at an independent locus, it 
became of interest to determine whether this suppressor would also sup- 
press other inositolless mutants. Consequently, simultaneous crosses 
were made between the suppressor combined with wild type (W.T. +Su.) 


MINIMAL 
MINIMAL + INOSITOL 


SPORE I5.1(INOS-37) SPORE 15.5 (wT+$u) 


ee re re en 


i i l 





ro) 
= 
— 
x 
2 
WwW 
= 
> 
x 
a 


SPORE 15.3 (INOS-37) SPORE (5.7 (w.7.48U,) 











DAYS 
FIGURE 3 

and several different inositolless mutants. Serial ascospore isolations were 
then performed to test for the presence of 6:2 and 8:0 inositol-independent : 
inositolless segregations, since more than four inositol-independent cultures 
in a single ascus would indicate a suppression of the particular inositolless 
mutant being tested by the suppressor of mutant 37401. Some of these 
crosses were also segregating for albino (al-15300) which facilitates the 
visual detection of any segregation irregularities. 

The results of these crosses, shown in table 2, indicate that only 4:4 
segregations were obtained with seven different inositolless mutants 





1X4 GENETICS: GILES AND PARTRIDGE Proc. N. A.S 


(although relatively few complete asci were obtained with mutants 5202 
and 37102), whereas mutant 37401 again showed the expected three types. 
Typical growth curves obtained from the four cultures in one ascus (No. 3) 
from across with mutant 89601 are shown in figure 4. 

In order to be certain that the suppressor was actually present, even 
though inactive, with the various inositolless mutants tested, further 


MINIMAL 
@—— MINIMAL + INOSITOL 


SPORE 3.1 (wT) SPORE 3.5 (w.7.+$u) 





SPORE 3.4 (INOS -89.) SPORE 3.7(iNOS.-89.4+SU) 


ORY WEIGHT (MG) 


led © lela * hello ' a 











3 
DAYS 


FIGURE 4 


Comparative growth curves obtained by methods indicated in figures 1-3, but from 
a cross of the suppressor ot 37401 combined with wild type (W.T. + Su.) with inositolless 
89601 (inos 89). Cf. table 3 


crosses to 37401 were made utilizing the four cultures from ascus 3 in the 
cross with S9601. The results of these crosses are shown in table 3, It can 
be seen from the ascus segregation patterns that the suppressor can be re- 
extracted from mutant 89601 and is again able to suppress mutant 37401, 
although it did not effect growth in the absence of inositol when combined 
with mutant S9601. 

Discussion.—The particular reversion selected for study in the present 





VoL. 39, 1953 GENETICS: GILES AND PARTRIDGE $85 


investigation was obtained as a phenotypically distinctive, slow-growing 
mutant from inositolless microconidia (of mutant strain 37401) exposed to 
ultraviolet irradiation and plated on minimal medium. Extensive genetic 
analyses of phenotypically ‘normal’ reversion types—those exhibiting a 
morphology and growth rate on minimal medium similar to imositol-in 
dependent, ‘‘wild type’ strains*-have demonstrated that the vast ma- 
jority of these behave in crosses as instances of reverse mutation. Pre- 
liminary evidence that certain of the slow-growing types represent in 
stances of suppressor mutation rather than of reverse mutation was ob- 
tained when crosses of certain of these mutants to wild type yielded inosi- 
tolless cultures from random ascospore isolations.” The present investiga 
tion gives detailed genetic evidence from serial ascospore isolations for one 
such mutant and demonstrates conclusively that reversions may be ob 
tained which are able to grow in the absence of inositol as a result of sup 
pressor mutation at a locus completely independent of the original ino 


sitolless locus. 


PABLE 4 


Tests 10 DEMONSTRATE THE PRESENCE OF THE SUPPRESSOR OF 37401 IN COMBINATION 

WITH INOSITOLLESS MUTANT 89601. CROSSES OF ONE CULTURE FROM EACH SPORE 

PAIR IN A COMPLETE AscuS ISOLATED FROM A Cross OF WILD TyPE PLUS SUPPRESSOR 
WITH S89601-aly (CF. Fic. 4 


NO. OF COMPLETE ASCI OF 
SEGREGATION TYPES INOSITOL 
INDEPENDENT INOSITOLLESS Al INDICATED 
SPORE NO CROSSED WITH 44 O:8 2:6 6°92) SEGREGATION GENOTYPE OF SPORE 


la aly? 3740 1-ale W.T 
fa aly’ 3740 1-aly 7 De) inos &Y 
aly 37401 -als * 2) W.T. + Su 
re inos 89. + Su 


3 
3 
3.5A 

3.7A alo 3740 1-al, * 


Growth tests of the present suppressor indicate that this mutation only 
partially suppresses the effect of the original mutation giving an inositolless 
phenotype. Although growth occurs in the absence of this substance, it 1s 
much less rapid than in inositol-independent strains (either wild type or 
typical reverse mutation types), and growth is markedly stimulated by 
the addition of inositol to the medium. The possibility that some change 
has occurred as a result of the suppressor mutation such that inositol is no 
longer required by the reversion, or is being replaced by some other sub 
stance, appears to be eliminated by the demonstration by microbiclogical 
assay that meso-inositol is present in the mycelium of the suppressor 
strain, although in a reduced amount compared with typical inositol-in 
dependent types. This latter condition presumably explains the relatively 
restricted growth of the suppressor on minimal and its stimulation by added 


inositol. 





186) GENETICS: GILES AND PARTRIDGE Proc. N. A 


Perhaps the most notable finding in the present study is the apparent 
marked specificity of this suppressor strain. Crosses of the suppressor in 
combination with the wild type allele at the inositolless locus were made to 
seven allelic inositolless mutants (all of independent origin). A simul- 
taneous cross was made to mutant 37401, in which the suppressor was ob- 
tained, as a control. The expected 6:2 and 8:0 inositol-independent : 
inositolless segregations were recovered in serial ascospore isolations from 
the control cross, but in all the other crosses only 4:4 segregations were 
observed. Appropriate additional genetic tests with one of the mutants 
(S9601), involving re-extraction of the suppressor, demonstrated that the 
suppressor was definitely present, but that no growth on minimal occurred 
even in protracted growth tests in tubes and in flasks. 

Previous studies of the same series of inositolless mutants used in the 
present tests for suppressor activity have demonstrated that these mutants 
are all allelic (or in some instances possibly pseudoallelic), but that several 
differ markedly in at least one respect—the frequency with which they 
undergo reverse mutation spontaneously or following ultraviolet radiation. * 
With respect to its frequency of reverse mutation induced by ultraviolet, 
mutant 37401 appears to be distinct from the other seven inositolless types, 
and the present evidence for the specificity of its suppressor adds another 
criterion of distinction. The evidence obtained from this suppressor does 
not, however, aid in elucidating the problem of possible pseudoallelism at 
this locus. The best evidence to date for recombination is that between 
mutants 37401 and 64001, from crosses of which inositol-independent 
cultures (which have been shown not to be pseudo-wild types®) are regularly 
recovered.” On the other hand, no evidence of recombination has been 
obtained from crosses of 37401 and 37102, involving tests of comparable 
numbers of ascospores. With both mutants, however, there is no effect 
of the suppressor of 37401. 

A similar specificity of suppressor effect in Neurospora crassa for one of 
two apparently allelic mutants of independent origin has been reported by 
Yanofsky.’ In this instance it was shown that the particular enzyme, 
tryptophan desmolase, which was absent from the original mutants, was 
present in the suppressed mutant, but could not be detected in the mutant 
which did not grow without tryptophan when carrying the suppressor. 
An additional noteworthy similarity exists between the results reported for 
the tryptophanless mutant and those obtained in the present study. 
When the suppressor mutation is combined with the wild type allele at the 
inositolless locus, a marked interaction occurs such that the growth of the 
resulting strain is greatly retarded compared with the normal wild type. 
Further, this effect cannot be overcome by added inositol. The nature of 
this interaction is not clear. At present it does not appear particularly 


profitable to indulge in speculations as to the biochemical mechanism by 





VoL. 39, 19538 GENETICS: GILES AND PARTRIDGE 487 


which the present suppressor is able to overcome, at least partially, the 
effect of mutation at the original inositolless locus. Certain aspects of this 
general problem have been considered for the tryptophanless suppressor,’ 
a situation in which the biochemical evidence is much more extensive than 
in the present case. 

The present instance of a suppressor which is effective in combination 
with only one of a series of allelic (or in some instances possibly pseudo 
allelic) mutants, provides additional evidence that similarity of response by 
two or more mutants in Neurospora to a particular suppressor cannot be 
used as a criterion of allelism. Although the original investigation of 
suppressors in Neurospora‘ suggested that such mutants might be useful in 
tests for allelism, it is now apparent that the effects of suppressor mutation 
may be more complex than was previously anticipated.* In addition to 
instances such as the present case, in which allelic mutants are not all af- 
fected by the same suppressor, instances are also known in Neurospora in 
which clearly non-allelic mutants are suppressed by the same suppres 
sor.” 8, 9 

Other instances of reversions are known in Neurospora inositolless mu 
tants which behave in preliminary tests like suppressors, and at least some 
of those in strain 37401 appear to be different from the present mutant. 
Additional investigations are contemplated to determine if genetically dis 
tinct suppressors do occur. Such tests may make it possible to establish 
other criteria for distinguishing among the apparent isoalleles at the ino 
sitolless locus, much as has been done recently at the vermillion locus in 
Drosophila melanogaster.'° 

Summary.—A genetic analysis has been made of a slow-growing re 
version obtained from platings on minimal medium of ultraviolet-treated 
microconidia of inositolless mutant 37401 in Neurospora crassa. ‘These 
tests indicate that the reversion arose as a result of an independent sup- 
pressor mutation rather than by the more usual reverse mutation. The 
resulting suppression of the ‘nositolless phenotype is only partial, how 
ever, since the suppressor grows much less in the absence of inositol than do 
typical inositol-independent strains and is markedly stimulated by the 
addition of inositol. Microbiological assays indicate that inositol is present 
in the mycelium of the suppressor but in reduced amounts compared with 
wild type strains. Genetic tests have been made to determine the effect 
of the suppressor of mutant 37401 on seven other allelic (or possibly 
pseudoallelic) inositolless mutants of independent origin. The results 
indicate that the suppressor is markedly specific for mutant 37401, since 
growth does not occur in the absence of inositol when the suppressor is 
combined with any of the other mutants. 


Acknowledgment.—The authors desire to express their thanks to Mrs. 





158 GENETICS: YOUNG ET AL Proc. N. A. S 


Norman Giles and to Miss Mary Case for their assistance in this investiga- 
tion. 


* This research was supported in part by a research contract with the Atomic Energy 
Commission 

' Giles, N. H., Jr., and Lederberg, E., Amer. Jour. Bot., 35, 150-157 (1948) 

* Giles, N. H., Jr., Cold Spr. Harb. Symp. Quant. Biol., 16, 283-313 (1951). Also 
unpublished material 

* Partridge, C. W. H., and Giles, N. H., unpublished. 

‘ Houlahan, Mary B., and Mitchell, H. K., Proc. Nat. Acapb. Sct., 33, 223-229 
(1947) 

® Schopfer, W. H., Bull. Soc. Chimie biol., 33, 1113-1146 (1951) 

® Mitchell, M. B., Pittenger, T. H., and Mitchell, H. K., Proc. Nat. Acap. Sctr., 38, 
HH9-5S8O (1952) 

’ Yanofsky, C., /bid., 38, 215-226 (1952) 

* Mitchell, Mary B., and Mitchell, H. K., /bid., 38, 205-214 (1952) 

® Lein, J., and Lein, P. S., [bid., 38, 44-48 (1952) 

'0 Green, M. M., Jbid., 38, 300-305 (1952) 


THE EFFECT OF PRETREATMENT WITH INFRA-RED 
RADIATION ON THE X-RAY INDUCED SEX-LINKED 
RECESSIVE LETHAL AND VISIBLE MUTATION RATE IN 
DROSOPHILA MELANOGASTER 


By WitttaAM J. YounG,* Henry T. Yost, Jr., P. T. Ives, AnD R. PAu 
LEVINET 
BIOLOGICAL LABORATORY, AMHERST COLLEGE, AMHERST, MASSACHUSETTS 
Communicated by A. F. Blakeslee, April 16, 19528 


Pretreatment with infra-red radiation has been shown to cause an in- 
crease in the number of gross chromosomal aberrations induced by x-radia- 
tion in Drosophila;' however, no increase in the number of recessive lethals 
has been found with similar treatment.’ These findings are of interest, 
since recessive lethals have been thought to arise, in large part, as a direct 


consequence of chromosome breakage.*~® Kaufmann and Gay?’ have 


explained these results by proposing that rearrangements do not give rise 
to any large portion of recessive lethals, and that the infra-red acts to sen- 
sitize the breakage ends of the chromosome so that recombination results 
rather than restitution. Thus there is an increase in the number of gross 
chromosomal rearrangements without an increase in the total number of 
primary breaks, and, consequently, there is no increase in the number of 
recessive lethals. The data of Kaufmann and Gay are in general agree 
ment with the hypothesis of Lea and Catcheside.‘ 

Difficulties of interpretation arise when the foregoing arguments are 





Vor. 39, 1958 GENETICS: YOUNG ET AL SY 


considered in the light of the data obtained with Tradescantia, using com 
bined infra-red and x-radiation. It has been shown that all types of chrom- 
osome aberration are increased by infra-red pretreatment and that the 
total number of primary breaks is increased.£-* If the mechanism of action 
of infra-red be the same in both forms, and if the recessive lethals be directly 
related to chromosome breakage, there should be an increase in recessive 
lethals with infra-red pretreatment. Since this discrepancy existed, it 
was felt that further investigation of the recessive lethal problem would be 
worth while. 

Recently two factors which modify the action of infra-red radiation have 
been studied. It has been shown that the temperature at the time of ra- 
diation is critical,® and that the intensity of infra-red radiation may be of 
importance.’ Swanson (personal communication) has demonstrated that 
the most effective temperature for Tradescantia is about 12°C., and the 
maximum temperature is about 22°C. It was felt that the effective tem 


perature for Drosophila might be lower than that previously tried (18°C.)' 


and that an increase in intensity of infra-red might influence the rate of 
recessive lethal induction. Experiments were undertaken to test these 
points. 

At the same time, a weakness in the test for recessive lethals was appar 
ent. The induction of more than one lethal on the X-chromosome would 
be missed. Since the x-ray and infra-red are additive, and since there is 
usually found a large increase in the number of rearrangements, it seemed 
likely that a chromosome which had been ‘hit’? by x-radiation (that is, had 
one primary break) would be likely to have more than one break if pre 
viously treated with infra-red radiation. For this reason attempts were 
made to determine the number of lethals on some of the tested X-chromo 
somes by studies of crossing over. Further, it was hoped that the use of 
visible mutations might help to elucidate the problem of multiple events 
and at the same time give an independent estimate of the induced muta 
tions. 

Experimental Procedures and Results.--A strain of Oregon-R, inbred for 
70 generations to insure maximum homozygosis* was used as the experi 
mental stock. During the course of the experiment the stock was kept in 
bottle cultures without selection or inbreeding. Males to be treated were 
selected between two and seven days after eclosion, to minimize the vari- 
ation in the natural mutation rate.’ The males were placed in shell vials 
and allowed to equilibrate to the cold-room temperature (15°C.) before 
immersion in the water bath, which averaged S°C. At this temperature the 
flies sit motionless on the food so that by slanting it all the flies are exposed 
quite readily to the radiation. The vials were immersed in the water bath 
during the infra-red treatment. Control vials were placed in the same 
bath, protected from the radiation by a black rubber covering. The tem 





190 GENETICS: YOUNG ET AL. Proc. N. A. S 


perature of the water bath at the end proximal to the radiation source was 
found to increase less than 0.5°C. over that at the control end, 37 em. dis- 
tant. The exposed vials were held within 5.5 cm. of the Corning No. 2540 
filter, through which all radiation passed. The infra-red source was a 
General Electric 250-watt heat lamp; the distance from the source to the 
treated flies was about 13 cm. This distance is less than that used by 
Kaufman, ef a/.,! and therefore the infra-red radiation was given at a greater 
intensity. Both control and infra-red treated flies were removed from the 


TABLE 1 
ContROL Data: No X-RAY TREATMENT 


SEMI 
TREATMENT rESTS LETHALS LETHALS VISIBLES TOTALS , P 


thrs. IR plus 8°C 588 l 0 l 0.17) 
8°C. alone 248 0 0.00 
t hrs. IR plus 8°C 339 2 2 0.59 
8°C. alone 313 : 0 64 
thrs. IR plus $°C 566 0.00 
8°C. alone 537 0.00 


0.5-0.6 


0.625 


TABLE 2 
SUMMARY OF DATA: COMBINED X-RAY AND INFRA-RED TREATMENTS 


sROUP SEMI 
NGO TREATMENT TESTS LETHALS LETHALS VISIBLES TOTALS 


| 2500 1 123 23 } 1 31 
thrs. IR plus 2500 r 479 23 10 l 34 
2500 453 15 b 3 21 
fhrs. IR plus 2500 r 472 31 5 1) 
2500 1 537 28 . j 36 
fhrs. IR plus 2500 r 516 20 : ‘ 27 
2500 1 168 30 é ) 4] f 
fhrs. IR plus 2500 r 515 39 2 : 4 7.5 0.90 
2500 r 522 32 
fhrs. IR plus 2500 r 522 24 ) 34 15 0:8 
1800 4 586 29 
5 hrs. IR plus 1800 r 587 16 5 ) 25 
1000 4 956 
7 hrs. IR plus 1000 r) 936 37 


“ Lethals not retested 


water bath and x-rayed at the same time. All x-ray exposures were made 
with a General Electric industrial machine, operated at SO kv. and 10 ma." 
The dosages for both types of radiation varied between experiments and 
are given with the data. 

Tests for induced mutations were carried out with the Muller-5 stock.!! 
Any vial in which three or more wild type males could be seen simultane 
ously was classed as non-mutant; that is, no recessive lethal or visible mu- 
tation had been produced in the chromosome tested. All vials having less 




















VoL. 39, 1958 GENETICS: YOUNG ET AL 191 


than three wild males, or whose males exhibited a visible mutation, were 
retained. The field of the microscope showed a maximum of thirty flies, 
and the presence of three wild males, indicating an excess of 10 per cent 
wild males, was considered not to be a lethal or semi-lethal mutation. All 
visible and lethal mutants were retested. Any case giving a total pro- 
portion of more than 10 per cent of wild males in not less than one hun- 
dred counted flies in the two generations was considered to be normal. 
Those counts giving under 10 per cent and over 0 per cent were scored as 
semilethals. In all cases, control and experimental series were handled 
simultaneously, with the vials randomized in such a way that the scorer 
was unaware whether he was examining experimental or control vials. 
Crossover tests were made of a selected group of lethals, using the y ct® 
ras* { stock. Comparison of data from different series was made by means 
of the Chi-square test. 

Table 1 presents the data derived from infra-red and cold treatments. 
It is evident that cold shocks alone, or in combination with infra-red treat 
ments, are not capable of increasing the frequency of sex-linked recessive 


TABLE 3 
COMBINED ANALYSIS OF THE DATA PRESENTED IN TABLE 2 


rOTAI MUTATION 
GROUP TESTS MUTANTS : 


All controls (2500 r) 2408 165 6.9 
All combined treatments (4 hrs 

IR plus 2500 r) 2504 179 
Both groups combined (line | 

plus line 2) 107 344 


lethals or visibles. Chi-square tests of the combined six series gave a P 
value of 0.1-0.2. The average mutation rate for all groups combined 
(0.199) did not differ from that obtained by Kaufmann and Gay? with 
18 hours of infra-red treatment in the Swedish-b stock (0.17 ). 

Table 2 presents the results of runs in which combined infra-red and 
x-radiation was given. Groups | through 5 are replications of one treatment. 
Groups 6 and 7 are modifications of this treatment. Only in group 2 was 


there an apparent difference in the frequency of lethals recovered. In this 


case, the data suggested that pretreatment definitely increased the reces 
sive lethal mutation rate. However, as table 3 establishes, the difference 
observed is best considered to be simply a chance deviation from the mean. 
This conclusion is based on the Chi-square values obtained by: (a) lump 
ing all control treatments, (2) lumping all experimental treatments, and 
(c) lumping all ten series. In each case the P value is of the order of 0.20, 
well above the level of significance. ‘Therefore, it must be concluded that 
pretreatment with infra-red, under these conditions, does not give a sig- 
nificant increase in the number of x-ray induced sex-linked lethal and 





192 GENETICS: YOUNG ET AL Proc. N. A. S 


visible mutations obtained. The data from groups 6 and 7 indicate that 
this conclusion is valid at lower x-ray doses. 

A group of lethals was selected from the combined treatment series 
(group 5, table 2) to be tested by crossing over studies, as it was felt that 
the effect of the infra-red might be obscured by the presence of more than 
one lethal on the X-chromosome. ‘Table 4 presents the data derived from 
these tests. Since only one stock out of the 40 tested showed the presence 
of more than one lethal, it must be concluded that the combined treatment 
did not produce a significantly greater number of chromosomes with more 
than one lethal than would be expected. 

Discussion. The experiments presented in this paper were undertaken 
to determine whether an increase in recessive lethals can be obtained with 
infra-red pretreatment when the modifying factors are suitably controlled. 
Data presented by Swanson and Yost® have shown that the infra-red effect 
is dependent upon the temperature at which the infra-red is given, with 
low temperature enhancing the effect. However, the data presented here 


TABLE 4 
DISTRIBUTION OF LETHALS 
ryPt TOTALS 
Single lethals 3a 
Lethals with inversions 4 


Lethals with deficiencies 2 
l 


Double lethals 


TOTAL 10) 


show quite clearly that lowering the temperature does not result in an 
increase in recessive lethals. When this was discovered, it was thought 
that perhaps the effect was being masked by the induction of double 
lethals. This would be likely to happen if the effect of the infra-red were 
dependent upon the summation of infra-red and x-ray energy. Only the 
chromosome which received x-ray energy could be affected, and that one 
might be severely damaged. ‘To test this possibility, some of the lethals 
derived from the combined radiation series were tested for the presence of 
more than one lethal by crossing over studies. As can be seen in table 4, 
the results were negative. It is true that the markers are far enough apart 
to permit two closely linked lethals to appear as one in some cases. | 
ay be assumed, however, that in most cases any two lethals would be far 
enough apart to be detected by the technic used, so long as the induction 
israndom. It is possible to postulate that double lethals arising from com- 
bined treatments may be non-random; that is, a particular x-ray track 
mught produce two closely placed lethals in a pretreated chromosome. 
Any lethals which lie closer together than five map units would go unde- 











Vor. 39, 1958 GENETICS: YOUNG ET AL 193 


tected by crossover studies. Such an hypothesis is essentially untestable, 
however, and should be disregarded. From the foregoing, it is evident 
that there is no increase in sex-linked recessive lethals when infra-red 1s 
used as a pretreatment to x-radiation. ‘These results are in accord with 
those obtained by Kaufmann and Gay.’ Similarly, there is no increase in 
visible mutations. 

In discussing their data, Kaufmann and Gay suggest that there is little 
role of position effect in the production of recessive lethals. This follows 
from the finding that there is an increase in gross chromosomal rearrange 
ment with infra-red pretreatment without an increase in recessive lethals. 
Such findings are in agreement with the hypothesis of Lea and Catcheside* 
on the origin of recessive lethals. On the other hand, Kaufmann and Gay 
state that, on the basis of this hypothesis, one would expect a decrease of 
the order of 5 per cent in recessive lethals with supplementary treatment, 
and their data indicate such a decrease. The data presented here do not 
show any such decrease. If there is any difference (and there is none 
shown by statistical methods), it is in the direction of a slight over-all in 
crease. Nevertheless, since it is evident that there is no induced increase 
of recessive lethals, and since it has been shown that gross chromosomal 
aberrations are increased,' it is possible to state that there is little apparent 
relationship between the two. 

Since the publication of the earlier work on Drosophila,’ evidence has 
been presented which indicates that in Tradeseantia the action of infra-red 
is to increase the number of primary breaks, not to modify the method of 


8 


recombination.® Kaufmann and Gay start with the assumption that 
the recessive lethals induced in Drosophila originate with chromosomal 
breakage in large part. Having made such an assumption it is necessary 
to assume further that the action of infra-red is to modify the recombination 
process and not to increase the number of primary breaks; for if there were 
an increase in breakage per se, it would result in an increase in the number 
of recessive lethals. The basic assumption about the origin of lethals is 
not necessary, however. In fact, one might turn this argument around 
and assume that if there is no increase in recessive lethals with infra-red 
pretreatment, there is no causal relationship between breakage and the for 
mation of recessive lethals. Without this latter argument, it is necessary 
to propose two quite different modes of action of the infra-red in Trade 
scantia and Drosophila. Such an hypothesis seems extreme even though 
it is clear that extrapolation from one form to another is dangerous with 
distantly related types. Furthermore, there is some evidence from natural 


populations of Drosophila which bears on this problem; that is, the fact 


that recessive lethals have a high natural mutation rate compared to other 
types of mutations. If we are to assume that chromosome breakage is 
necessary for the induction of a large portion of recessive lethals, we should 





404 GENETICS: YOUNG ET AL Proc. N. A. S 


find a high degree of correlation between recessive lethals and chromosomal 
rearrangement in natural populations. Ives,'’ in a survey of one local 
wild population, has shown that while 7.5 per cent of 159 lethal chromo- 
somes carried inversions, 116 non-lethal chromosomes from the same pop- 
ulation had an inversion frequency of 8.6 per cent. Clearly there is no 
relation in those data between chromosomal breakage and recessive lethal 
mutations. Further studies of this kind (by Levine and Ives) are in prog- 
ress in this laboratory, utilizing artificial cage populations of many thou- 
sands of individuals related to Oregon-R and to the local wild population. 

The question which arises in this connection is whether we can logically 
assume that there is a higher frequency of chromosome breakage than of 
point gene mutation. (Point gene mutation is here interpreted as a chem 
ical change not involving the entire chromosome strand.) This seems un- 
likely. It has been demonstrated quite clearly that a high proportion of 
x-ray induced recessive lethals occur near breakage points in the chromo- 
some ;* but this is not evidence that there is a causal relationship. It is 
perfectly evident that the breaks give us visible proof that some change 
has taken place; but the order of magnitude is so great in such changes 
that it is not unlikely that mutations should take place in the same vicin- 
ity. If we consider the mutation process in the light of the model proposed 
by McElroy and Swanson," it is clear that the rate determining factor 
is the energy of activation of the chemical change which results in muta- 
tion. Obviously different processes will have different energies of activa- 
tion. Unless we assume that a// mutation is accompanied by chromosomal 
breakage (in the usual sense), we can see that when the x-ray provides the 
energy necessary to accomplish chromosome breakage, it must at the same 
time provide energy which surpasses the energy of activation of other mu- 
tation processes. The question which arises at this point is why the com- 
bined infra-red and x-ray energy does not promote mutation. There are 
two factors to be considered regarding this problem. Since the absorption 
of the energy of each radiation is separate in time, and since neither pro- 
vides suflicient energy singly to cause chromosome breakage (here consider- 
ing the ‘increased’ portion of the breakage), it is possible that the total 
energy incorporated in the chromosome at any one time is not sufficient to 
cause both chromosome breakage and mutation. Secondly, the infra-red 
effect depends upon the absorption of specific wave-lengths of infra-red, 
and consequently only certain regions of the chromosome will be sensi- 
tized by pretreatment. When the infra-red energy is absorbed, these 
regions are effectively weakened, or made more susceptible to breakage 
by x-ray energy. With the addition of energy derived from x-radiation, 
these regions, predisposed to breakage, undergo disunion. The pretreat- 
ment with infra-red has determined both the quantity and the quality of 
the reaction in this manner; there is no excess energy to be utilized by re- 











VoL. 39, 19538 GENETICS: YOUNG ET AL 495 


lated structures for mutation to any appreciable extent. Regardless of 


the mechanism, it seems permissible to conclude from the data presented 


that the recessive lethal is not dependent upon chromosomal breakage. 
Such conclusion is borne out by the lack of effect of infra-red pretreatment 
on the induction of visible mutations. 

Summary.—Pretreatment with infra-red radiation does not increase the 
number of sex-linked recessive lethal or visible mutations induced by x-ra 
diation, even though the environmental factors known to influence the 
action of infra-red are favorable. Crossing over studies of the lethals failed 
to indicate any significant portion of the chromosomes with two or more le- 
thals resulting from the combined radiations. It is concluded from these 
and other findings and from theoretical considerations that recessive lethal 
and visible mutations do not result from chromosome breakage and that 
neither type of mutation is to be expected as a result of infra-red radiation 
acting alone or in pretreatment before x-radiation. 


* Present address: Department of Biology, The Johns Hopkins University, Baltimore, 
Md 

+ The organization and scoring of the experiments was done under the supervision of 
Ives. The radiation was carried out by Yost. The routine of the experiments was 
shared by all authors but carried chiefly by Young. The manuscript was prepared 
chiefly by Young and Yost and is adapted in part from Young’s thesis for the degree of 
Master of Arts at Amherst College (1952) 

' Kaufmann, B. P., Hollaender, A., and Gay, H., Genetics, 31, 349-367 (1946) 

? Kaufmann, B. P., and Gay, H., these PROCEEDINGS, 33, 366-372 (1947). 

> Demerec, M., Cytologia, Fujii Jubilee Volume, 1937, pp. 1125-1132 

‘Lea, D. E., and Catcheside, D. G. J., Genetics, 47, 10-24 (1946). 

5 Herskowitz, I. H., /bid., 36, 356-363 (1951) 

® Swanson, C. P., and Yost, H. T., these PROCEEDINGS, 37, 796-802 (1951) 

* Yost, H. T., Genetics, 37, 457-468 (1952) 

5 Ives, P. T., [bid., 30, 167-196 (1945) 

9 Ives, P. T., Evolution, 4, 236-252 (1950). 

The authors are indebted to Profs. Carl Kaiser and Karl Hendrickson of the Uni 
versity of Massachusetts, who made the x-radiation facilities available for this study 

1! Spencer, W. P., and Stern, C., Genetics, 33, 43-74 (1948) 

'2 Ives, P. T., Evolution, 1, 42-47 (1947) 

18 McElroy, W. D., and Swanson, C. P., Quart. Rev. Biol., 26, 348-363 (1951). 





MATHEMATICS: D. GALE Proc. N. A. S 


A THEORY OF N-PERSON GAMES WITH PERFECT 
INFORMATION* 


By Davip GALE 
BROWN UNIVERSITY 
Communicated by J. von Neumann, April 17, 1958 


1. Introduction.—The theory of the general n-person game, in contrast 
to that of the zero-sum two-person game, remains in an unsettled state. 
The chief problem seems to be that of determining the proper definition 
of a solution for such games. The efforts in this direction divide them- 
selves into two groups, the cooperative theory in which the players are 
expected to form coalitions, and the non-cooperative in which such coali- 
tions are forbidden. The first group includes the theory of von Neumann 
and Morgenstern! and the more recent work of Shapley,’ the second the 
equilibrium point theory of Nash.* The cooperative theories have the 
shortcoming that the notion of solution does not give much insight into 
how the games should be played. The non-cooperative theory is more 
specific in this regard, but unfortunately the definition of solution only 
applies to particular games and as yet no way has been found to determine 
a priort whether a given game is solvable in the non-cooperative sense. 
Certainly many interesting games are not solvable by this definition. 

The theory to be outlined below gives a very specific notion of solution 
for a special but important class of n-person games, namely, those with 
perfect information. The theory is non-cooperative in the sense described 
above. The most general theorem in the literature concerning such games 
is due to Kuhn‘ and states that all such games possess an equilibrium 
point among their pure strategies. This equilibrium point will not in 


general be unique, nor will such games be solvable in the sense of Nash. 
We therefore approach the problem from a different point of view. Our 
definition of solution leads to a particular set of strategies which form an 
equilibrium point, but these will generally be mixed rather than pure 


strategies. 

2. Some Examples.-An example will be useful in motivating the 
subsequent definitions.® An integer between 1 and 9 is to be chosen at 
random by an umpire. Three players are to choose in turn distinct 
numbers from this range, and the person guessing closest to the chosen 
number receives one dollar. In case of a tie each player wins a half dollar. 
A very naive first player might guess 5 initially. A little further thought 
reveals, however, that the second and third players would be expected to 
choose numbers 4 and 6 so that the first player's expectation would be 
' yy as compared to ‘/,) for players 2 and 3. A “‘correct’’ way to play 
the game would be for player I to choose either 3 or 7, player II, 7 or 5, 





VoL. 39, 19538 MATHEMATICS: D. GALE 197 


and player III any remaining number. <A rough justification for these 
choices is the following. If players I and II have already chosen, the 
choice for player III is clear, he will choose the number which maximizes 
his expectation. If there are several numbers which do this, we assume 
he is equally likely to pick any one of them. This amounts to the as 
sumption that player III is only interested in maximizing his own expecta- 
tion and is not concerned at all with the expectations of the others. Now 
assume player I has chosen. Player II tentatively considers each possible 
choice under the assumption that player III on the next move will play in 
accordance with the principles already described. He makes that choice 
which leaves him with the largest expectation under these assumptions. 
Now consider the initial choice for player 1. For each possible choice 
he can compute how players II and III will choose in accordance with 
the principles described above. Thus, im each case he computes his 
expectation and plays so as to maximize it. Again, if there are several 
choices which maximize, he chooses among them by a random device. 
The expectations for players I, II, and III if they play as described will 
be 7/18, 7/1g, and 4/;. 

The theory to be outlined here makes precise the notions described 
above for any game with perfect information and provides a definition of 
values and optimal strategies for such games. 

The reader may wish to amuse himself by solving in the sense of the 
above example the following continuous version. Three players choose 
in turn a real number on the unit interval. A player 1s said to ‘‘control”’ 
the number ¢ e [0, 1] if ¢ is closer to his choice than to that of the other 
players. At the end of the game each player is paid an amount equal to 
the length of the interval he controls. 

3. Definitions and Notation.—We shall consider here only finite games, 
and for the sake of clarity and keeping notation under control, we shall 
assume that the games involved do not have chance moves although all 
our results carry over to this case. 

The notations used here are essentially those introduced in an earlier 
paper. We summarize them briefly. 

Definition: An n-person game I 

(1) A set X called the set of positions of T. 

(2) A point x» « X called the initial position. 

(3) A function & called the predecessor function which maps X — xX 


’ 


with perfect information consists of: 


into X and has the property: (A) There is an integer NV such that @*(X) = 


Xo. (Intuitively, the positions which can be reached on the next move 
from positions x are the positions ?~'(x).) The set of positions x such 
that ®~'(x) = Y is called the set of final positions and is denoted by X f. 

(4) A partition of X — X, into sets (X,, X2...X,). (The position 
v ¢ Y, if the 7th player is to move at position x.) 





408 MATHEMATICS: D. GALE Proc. N. A. S 


(5) mn real valued functions P;, P., ..., P, defined on the set Xp. P1 
is called the 2th player's pay-off function. 

The above five objects constitute the game [’. It remains to describe 
how the game is to be played. The intuitive notion is the following: 
Suppose x» « X;. Then player 7 chooses a position, x; € P~'(x%9). Now 
suppose x, « X;. Then player 7; chooses a position x2 € @~'(x), ete. 
Play continues in this way until a position x ¢ X» has been chosen, which 
will always occur because of property (A). At this point the play has 
terminated and player 7 receives the amount P;(x),17 = 1, ...m. 

As is customary in game theory (but not in practice!), we assume that 
each player has chosen his “‘strategy’’ in advance, that is, he has made up 
his mind as to what choice he will make at all conceivable positions. The 
following definitions make these notions precise. 

Definitions: A play of T is a sequence (Xo, %1, X2, ..., Xx) where P(x;) = 
X,-1, and x, « Xp. 

A pure strategy o, for player 7 is a function from X, into XY such that for 
each x € X,, a(x) € @~(x). 

Given an n-tuple of pure strategies (01, 02, ..., 7,) we define inductively 
a play (xo, X41, , X,) as follows: x» is the initial position. Suppose x, has 
been defined and x; ¢ X; Then xj+; = o;(x,). It follows from the 
properties of the o,; that the sequence so defined is a play, which we call 
the play determined by (01, 02, , o,) and denote by the symbol <a, 
02, , o,>. Finally, if x, is the final position of the play <o, 02, ..., 
o,>» we define <a}, 0, , Ceri = Pixgt = 1,:2, ...,”. Thus, <o;, 
02, ..., >, 18 the pay-off to player 7 if the strategies (0), 02, ..., 7,) are 
chosen by the n-players. 

(There is a standard geometric picture of [. The positions are repre- 
sented by points in a plane. If x = @(x’) a directed line segment is drawn 
from x to x’. One can arrange the points x so that these lines do not 
intersect, thus the figure becomes a topological tree in the plane, and each 
play is represented by a path of segments running from the initial position 
vy to some final position. Because of its intuitive appeal, some authors 
have used the topological tree as a primitive concept in defining a game. 
From the mathematical point of view there seems little purpose in intro- 
ducing these geometric irrelevancies into the theory. Further, the pred- 
ecessor function seems easier to work with. The tree picture may 
nevertheless be useful in helping one to visualize many of the properties 
of games. ) 

For the work that follows it is convenient to make a small modification 
of the concept of pure strategy. In general each time a player makes a 


move he precludes the possibility of certain positions occurring. It would 
clearly be a waste of time for him to plan his strategy for positions which 
he knows cannot occur. Following Kuhn we call the position x irrelevant 





VoL. 39, 1958 VATHEMATICS: D. GALE 190 


for the strategy o; if x belongs to no play <a, a», eee re 
wise x 1s called relevant, and a subset XY’ ¢ YX is called relevant for o,; if 
all its positions are relevant. Our modifica ‘tion consists in requiring that 


each strategy o, be defined only on a maximal relevant subset of Y,, 


relative to o,;. This condition effects a reduction in the number of pure 
strategies. 

Finally we recall the definition of an n-person game in norma! form. 
Such a game consists of m sets >), }o», , >.» Whose elements are called 
strategies, and n real valued functions P;, Ps, ..., P, whose domain is the 
set )ixX>d... XD. fin our game I we let 5°, denote the set of pure 


strategies for player 7, and define 
P (on, Or, sy Tn) = <4, 02, , Tn> iy 


we see that we have defined a game in normal form corresponding to I 
which we shall call the normal form of VT. 

4. The Theory.—In this section we shall define a new notion of solution 
of a game. Games in general will not possess solutions in our sense. 
However, our fundamental existence theorem states that games with 
perfect information always have such solutions. Our definition of solution 
will determine a unique optimal strategy for each player and a set of values 
or expectations for each player provided all play optimally. 

A further preliminary remark is of interest. In examining the reasoning 
used to justify our solution of the example of section 2, it appears that 
essential use has been made of the extensive form of the game involved. 
In particular the order in which the players moved played a central role 
in the argument. It is somewhat surprising, then, that the same solution 
could have been obtained by examining only the normal form of the game. 
Indeed we shall define our notion of solution from the normal form alone. 
Thus, vastly different games will have the same solutions provided they 
have the same normal form. 

Definition: The pure strategy o; dominates o,', written 0; > o,', provided 

(1) <a, 0, , Sey = <a, os, , 0,>1 for all (a9, oak. 

28) ids Paes oo ee le a a , O,>1 for some (a2, PP 


The pure strategies a; and 0,’ are equivalent if 
<1, 02, ’ On>1 = <oy’, 02, ’ Tn>) for all (02, ’ ,)- 


Similar definitions are made for strategies o,,1 = 2...n. 

Our concept of a solution is based on the following two assumptions: 

(I) <A player will not use a dominated strategy. 

(II) A player will use equivalent strategies with equal probability. 

A word concerning these assumptions. The notion of dominance has 
occurred frequently in the study of games, although the usual definition 
is not precisely the one given here. The second assumption asserts the 





HOO MATHEMATICS: D. GALE Proc. N. A. S 


indifference of each player concerning the fate of the others. It might 
be argued that this condition is unnatural or too restrictive. For example 
in a two-person game a player might decide, other things being equal, to 
do the most possible damage to his opponent, or, if he were more amiable, 
to help his opponent. Actually we could develop the theory equally well 
by replacing assumption (II) by various other assumptions. The only 
requirement is that we shall have some definite rule which will determine 
the behavior of each player when he is confronted with equivalent strate- 
gies, that is, a rule which makes correspond to any set of equivalent pure 
strategies a probability combination of these strategies. Our indifference 
assumption, (II), is the simplest such rule, but others would serve equally 
well. 

Corresponding to assumptions (I) and (II) we now introduce two 
operations on the normal form of T°. 

Operation D,: Deleting all dominated strategies. 

Operation A,: Replacing all maximal sets of equivalent strategies by 
a single mixed strategy using each equivalent pure strategy with equal 
probability (A ~ averaging). 

It will be observed that these two operations, when possible, have the 
effect of reducing the number of strategies for each player. 

We now define a new game I"! in normal form whose pure strategies are 
those obtained from the original sets by operations D, and A;. The pay- 
off functions P;' are the original pay-off functions P; extended in the 
usual way to mixed strategies whenever necessary. 

We now define operations DD. and A» to be similar to D, and A, but 
operating on the game T''. We thus obtain a game I’. Since the original 
number of pure strategies was finite, 1t is clear that eventually this process 
must terminate, i.e., no further reduction will be possible. 

FUNDAMENTAL THEOREM. If IT ts the norma! form of a game with perfect 
information, the successive applications of the operations D, and A, will 
after a finite number of stages reduce V to a game \* having exactly one pure 
strategy for each player. 

It is now clear how to define the solution of I. Since a pure strategy 
for I“ 1s a strategy (in general, mixed) for I’, we define these unique strategies 
as the optimal strategies for and the values are, of course, the respective 
pay-offs to each player when these strategies are used. 


The proof of the fundamental theorem is quite involved due to the rather 


complicated structure of games in extensive form. The usual technique 
of performing an induction on the length of the game does not seem to 
work here. The proof proceeds by showing that at any stage of the 
reducing process, if any player has more than one strategy left, then further 
reductions are possible. 

As remarked earlier, the optimal mixed strategies form an equilibrium 





VoL. 39, 1953 MATHEMATICS: V. HLAVAT } 5ol 


’ 


point. This shows then that if I is two-person zero-sum, our definition 
of value agrees with the usual one and our optimal strategies are optimal 
in the accepted sense. The reason we obtain a unique optimal strategy 
is of course due to the averaging process. This is rather unnatural in 
some cases; for instance, if a player knew several optimal strategies for 
chess he would probably use the one which would terminate the game 
most rapidly. Introducing such considerations would, however, complicate 
the theory beyond reason. 

* Written under contract with the National Science Foundation 

! Neumann, J. von, and Morgenstern, O., The Theory of Games and Economic Behavior. 
Princeton University Press, 1947 

2 Shapley, L. S., “A Value for m-Person Games,'’ Contributions to the Theory of 
Games——-II, Annals of Mathematics, Study No. 28, Princeton University Press, 1953. 

3 Nash, J. F., ‘‘Non-cooperative Games,’ Ann. Math., 54, 286-295 (1951) 

‘Kuhn, H. W., “Extensive Games,’’ Proc. Nati. AcCAb. Ser., 36, 570-576 (1950) 

5 Examples of this type were brought to my attention by Mr. J. D. Ellinwood of 
Brown University, and it was from consideration of these examples that I was led to 
the investigations reported on herein 

®§ Gale, David, and Stewart, F. M., “Infinite Games with Perfect Information,” 
Contributions to the Theory of Games--I1, Annals of Mathematics, Study No. 28, 


Princeton University Press, 1953 


THE SPINOR CONNECTION IN THE UNIFIED EINSTEIN 
THEORY OF RELATIVITY 


By VACLAV HLAvaTy 


GRADUATE INSTITUTE FOR APPLIED MATHEMATICS, INDIANA UNIVERSITY, BLOOMINGTON, 
INDIANA* 


Communicated by T. Y. Thomas, March 26, 1953 
1. Introduction. The Einstein unified theory of relativity is based! 
on a real tensor g,, which is neither symmetric nor skew-symmetric. We 
denote its symmetric (skew-symmetric) part by /,,(k,,),? assume that 
hy, is of the signature — — — + and regard it as the metric tensor of the 
space Y, of the unified theory of relativity. Moreover, we denote by g, 
h, k the determinants of g,,, Ay, and k,,, respectively, and assume through 


out this paper 
gh # 0. 


The structure of g,, depends on k as well as on 


b ts : 
D (; hk kas) thk. 





HO2 MATHEMATICS: V HLAVATY Proc. N. A.S 


The tensor g,, being real we have only the following possibilities 
(a) D 5é.0, k#O0 


(b) D # 0, k=0 

(c) D = 0, k= 0. (1) 
Throughout this paper we confine ourselves to the cases (la) and (1) 
only. The main scope of this paper is to find a connection of the spinor 
space associated with X, im terms of the given data (1.e., g,,) only. How- 
ever, we confine ourselves to results without proofs. A more detailed 
study with the proofs will be published later.’ 

2. Non-holonomic Frame in X4.-Denote by ((Lg")) a diagonal matrix 


whose diagonal elements are 


l 
and denote by Lg” the elements of the inverse matrix to ((Lg”)). More 


over, let us introduce the matrices 


((hin)) = ((h*%)) 


There are four linearly independent vectors Ey, ... determined 
uniquely by g,, up to the transformations 


A Aj B 
(a) 'E,” = Le Fy 
and four linearly independent contravariant null vectors 
.y By 
ky = hapky h 


known up to the transformations 


1 
(b) heat = | a Ep” 


such that 


ALE AB ne a 
han = hy, Ea” Ep", h =h* hy” kb, 
ine B ah ; 
ge ae Te Pa Ef Ex: Sib... (33) 


sine | ae | — a 3 
Phe vectors /,', £,"' (as well as /;", Ey;’) are complex conjugate, the re 
maining vectors are real. The functions /,°, ky” may be thought of as 
intermediate components of the unit tensor which relates the holonomic 


components v” (w,) of a vector and their non-holonomic components 


> Xd 
Wz = kp Wy 





VoL. 39, 1953 MATHEMATICS: V. HLAVAT) 508 


. . , B . - 

(so that in particular hy, h*® are non-holonomic components of /y, and 
N i ro — , ° 
h™, respectively). The change of frame (2) changes the non-holonomic 

components according to 
os a A, » 
WR = Lez Wy. (3) 


3. The Local Spinor Space S;(P).—Let P be a generic point of X4. 
Denote by 74(P) its tangential space at P and by /;(P) the ideal space of 
7,(P). The components v’ (P) (w, (P)) of a vector in 74(P) may be looked 
upon as homogeneous projective coordinates of a point (plane) in /;(P). 
If in particular 

hv v* = 0 
we say that v’ is an isotropic point. Every other point will be termed an 
anisotropic point. Introduce the matrices associated with the E.* 


E," 


((F,,"(P))) = 


((*Ug*(P))) = (CU,'"'(P))) 


il 














(e = +) 
and the scalars associated with a vector v* (P) 
vo= hy vv" at P 
and put 
(a) v»°(P) ky ,"(P)v*(P) 
(b) ‘v(P) = *Ug(P)v"(P) (4) 


The numbers v,"(P) may be looked upon as a set of homogeneous com 
ponents of a tensor in a projective three-dimensional space which we denote 
by S;(P). Assume first that v*(P) is anisotropic. Then the tensor v,°(P) 
defines a biaxial involution in S;(P), whose (mutually skew) axes have the 
Pliicker point coordinates *y2”( P) given by (4b). If however, v(P) is iso 


ab ab a 


‘ 4 ‘ - h 4 
tropic then the lines (46) coincide ty = ~v*" =v" and constitute a linear 





504 MATHEMATICS: V HLAVATS Proc. N. A. S 


° . . . A a a ‘7 
hyperbolic line congruence C(P) in S;(P) as v‘(P) moves. The axes of 


C(P) have the Pliicker point coordinates 
A” (0 1000 0), 7A” (1 000 0 0). 
The locus of all lines ‘v*’(P) and all lines of C(P) is a linear line complex 
A(P) in S,(P). Its coordinates are 
Ke = 14% + 24%, 


The space S;(P) will be referred to as the local spinor space attached to 
1(P). Every point S*(P) of S;(P) will be referred to as a spinor in S;(P). 
The change of frame of reference in S;(P) is 

‘SCP) = A;*(P) S*(P) (5) 
where the A’s are determined by (2) 


v+iy 
TT ae 


0 


0 








P 








l 
and a 1,7. We denote by A,“(P) the elements of the matrix inverse to 


(6). The line v“’(P) in S;(P) corresponding to an isotropic point in /;(P) 
has the equations 
S*(P)o'(P) + S“(P)vo"(P) = 0 
S'(P)v"(P) + S2(P)o™(P) = 0 (7) 
where S"(P) is a generic not focal point of v’(P). The components v'(P) 
and v''(P) being complex conjugate (while v'(P) and v'Y(P) are real) we 
have from (7) 
S'(P) S3(P) 
S*(P) S'(P) 
or sometimes even 


(b) SUP) = SP) S2(P) = S*(P). (8) 


A spinor S“(P) satisfying (Sa) but not (Sd) will be referred to as an analyti- 
cal spinor. A spinor satisfying (Sd) will be termed a physical spinor. 


This distinction is preserved by (6). 
#. The Spinor Space ¥>;.—Every point P of X,4 gives rise to a local 





Vou. 39, 1953 MATHEMATICS: V. HLAVAT} 5OS 


spinor space S;(P), with the congruence C(P) and the complex A(P). 
The axes 'A and ?A have the same Pliicker coordinates in every S;(P). 
Therefore we may identify them with two arbitrary skew lines of a three 
dimensional projective space >>. In doing so we superpose all spaces S;(P) 
on >>; in a manner just mentioned. The space >>; will be referred to as 
the spinor space attached to Xy. As the point P moves along X4 the map of 
an isotropic field v4 = /,* v* is the set of lines 
v0 O vil ot ptt yt) (9) 
in >>. Hence these lines belong to the congruence with the axes 'A, 74. 
In other words, the congruence C(P) of S;(P) is identical with the just 
mentioned congruence C in >o;.° The same is true for the complex A(P) 
of S,(P). It is identical with the complex '4@ + 2A” in Dos, A(P) = K. 
5. Connections of S-;. The superposition of all S;(P) on }°; results in 
a set of connections in >>;, which we are going to investigate: Let P and 
QO be two points of V4, S;(P) and S;(Q) their corresponding local spinor 
spaces (both superposed on 5°;). Consider a projectivity P,°(P'Q) 
which maps a spinor S*(P) of S;(P) on a spinor 


S*“(P\Q) = P,(P\Q)S"(P) (10) 


of S;(Q). We assume that this projectivity does not depend on the refer 
ence frame, is not degenerate and reduces to identity for P = Q. More 
over, we assume that a physical spinor is mapped on a physical spinor by 
(11). Using the fact that C C(P) for all P of Xs we require also that the 
axes 'A and *A of C be reproduced by (11). Keeping Q fixed and putting 


II, : : P,’(P\Q) 
és Ox” ‘ : P=0O 


we obtain in II,,° a connection 
, a 
IT, 


so that in particular 


On” IT. (]2) 


ra IT," { 


is also a connection and I’,,° Q. This connection is given by the matrix 


(b) (( 157) ) 





506 MATHEMATICS: V HLAVATY Proc. N. A. S. 


where dy, 4, ¢, are arbitrary vector functions of P and dy, b,, & their com- 
plex conjugate. 

6. The Connection of >0;.- Let us denote by Z, the symbol of the 
covariant derivative with respect to (12) and assume that it operates in X, 
by means of the Christoffel symbols of /,,. In order to define the vectors 
ay, by, ¢, in a proper way we require that the relationship (4a) be not dis- 
(dx arbitrary). 


carded by the operator dx“ Z, 


(a) £&,F,,’ = 0. (13) 


*- ae ai . . = F . . 
Using now the definition of the matrix /,,’ as well as (126) we obtain from 
(13a) 


(4) Eun’ 2, Fan {- Ex" = <a 


-Ey* =, Ey! = Ew = 


> hm TF ‘hem op IV 
Eun’ 7 fk) = ky = J = = (1.3) 
Interchanging the indices I and II in (136) we obtain 2d,, 6,, &,. Substi- 
tuting these values into (126) we obtain a uniquely defined connection V,,° of 
ds, for which (13a) holds. It is to be remarked that 


—_ 1Aab ™ 2Aab = i = 
=, 1A =A = EK” = 0 


as it was to be expected. 


* Prepared under Army Contract DA 33-008 OR D-467 

' Kinstein, A., The Meaning of Relativity, Appendix II, 1950 

2 The Greek indices run from | to 4, the small Latin indices from 1 to 4, the capital 
latin indices from I to IV. Moreover, the index @ in section 3 runs from | to 6 

> The proofs of the statements of sections 2, 3, 5 are based on the results of the follow 
mg papers 

Section 2: Hlavaty, V., ‘The Elementary Basic Principles of the Unified Theory of 
Relativity. A,’’ J. Rat. Mech. Anal. 1, 539 562 (1952). The statement about the 
structure of g,, is taken from this paper 

Section 3: Hlavaty, V., “Spinor Space and Line Geometry,’ Can. J. Math., 3, No. 4, 
142-459 (1951) and “Spinor Space and Line Geometry II," J. Rat. Mech. Anal. 1, 321-3839 
(1952). 

Section 5: Bortolotti, E., and Hlavaty, V., Contributi alla teoria delle connessioni,” 
Annali di Matematica, 15, Series 4, (1-71) (1986-37 ) 

A spinor connection based on projective theory of relativity is dealt with in Veblen, 
O., and Taub, A. H., “Projective Differentiation of Spinors,’’ Proc. Nati. ACAD. Scr. 20, 
&5 92 (1934) 

‘The superscript 4 (the subscript a) in the first matrix denotes the rows (columns) 
In the second matrix the columns are labeled by the superscripts ab which take the values 

12 34 23 14 31 24 
in this order 

’ The lines (9) in Yy belonging to an isotropic field v* are the rulings of the congruence 
C. The same congruence regarded as C(P) 1m S;(P) is constituted by the rulings which 
map on.S,(P) allisotropic points v’(?) for P tixed 





VoL. 39, 1953 MATHEMATICS: V. HLA VAT) 


CONNECTIONS BETWEEN EINSTEIN'S TWO UNIFIED 
THEORIES OF RELATIVITY 
By VACLAV HLAVATY 


GRADUATE INSTITUTE FOR APPLIED MATHEMATICS, INDIANA UNIVERSITY, BLOOMINGTON, 
INDIANA* 


Communicated by T. Y. Thomas, March 26, 1953 
1. The Problem to Be Solved. \n his three earlier papers! Einstein has 
based his unified theory on a complex tensor 
(a) *o,, = hy, + thy, 
hy, = Neays ky, a Riu) (1) 


* 


and on the complex connection *1,’, which satisfies the field equations 


(a) Ou*2n,, = *Txe “San + Tu Bre 
The tensor *g,, being obviously hermitian symmetric 
(a) * on = *o, 
one assumes also the hermitian syminetry of the connection 


«fT ¥ — nn - 
(b) le 


In his recent book? Einstein based his unified theory on a real tensor 
(b) Lye = In + Ray 


hy, = hoy» ky 


> 


Rina 
and on a real connection [’,, which satisfies the field equations 
— - . 
(b) OuL ru = | pe Fs I pil (2) 


Let us identify the tensors /,, in (la, b) as well as the tensors ky, in (la, b). 
The main scope of this paper is to find the relationship between *T,j, given 
by (2a) and Ty’, given by (2b). 
2. First Step.—Let us put 
a ee 


+ n xO # 
oe Oi 
where *A and *S are real. The condition (36) yields 


*A * se °. £e ’ - 
Ay, = *A i S./. (5) 


. 
(urd)? 
The underlying space being real and referred to reai coordinates we see at 
once from the transformation rule of the *I's as well as from (5) that 


*A\i, is a (Symmetric) connection while *.S, 
and 1n particular 


, 1S a skew-symmetric tensor 





MATHEMATICS: V HLAVATY Proc. N. A. S. 


(a) *T'py) = t*S),’ (6) 
In the next section we will compute *A,,” as well as *.S),”. 


, 


3. Covariant Derivative *D,. If we denote by *D, the symbol of the 
covariant derivative with respect to the symmetric connection *A,,, then 
(2a) splits in two real sets namely 


(a) *DAy, = —2*S.a hk 


wa 


(+) *Dioky = —2 *Sop hye (7) 


Introducing the symbol V, of the covariant derivative with respect to the 
Christoffel symbols },,{ of A,,* and putting 


* a fvl xyTY , 
Ay, ia’ tau - l Au (8) 
we may rewrite the equations (7) as follows 


ee . se 
(a) Ug Bag HOU gg Ing = 2 *Soa*h 


(b)  Vkny = *U% 4 Ray + *U 


w ap 


wa 
ai Bane TS Sat, Wile » 
Let us denote by (uAw) the equation (9a). If we compute 

(uAw) + (Awu) — (wera) 
we obtain 
Pk 


(a2) Wy, «= —2h* *S,c%hy 


a 


On the other hand if we introduce the tensor 


K ou = Vo hyn + Vy ke + VLR 


wer 
we have from (10) and (9b)4 
2 *¢ , B eG bP La hee 
? no | es - 1k. Sy eye, 1Ri., ky S,gy° 
If we put 
* ying [Een] sf op. Inst] BS 1 ogll ni Bb 
Acs 6}4,6),) 5) | Rie 61) 2, + 26 > Rte ky) 


then we may write (lla) in a more condensed form 


| 


xo * ying . ’ ) 
(a) Stor "Xoee = 5 Kow- (12) 


~ 


4. The System (2b). The system (2b) has been solved for the I'’s in a 
previous paper.’ In particular we have obtained the solution in the form 


fiet Th Sins TF Oy (13) 


where 





Vo. 39, 1953 MATHEMATICS: V. HLAVATY 


(b) U"’, 


— sShLva © SB, 
a = 2h Say k,p 


(b) > a Tiny): 


“ 


The tensor S,,, satisfies the equations 


(b) Suc a 


Eng wr 


} 


st] 2 3 oplé 
aia 2k," 5) k ‘ 26° ky." k P * 


v u| 
Denote by g, 4, k the determinants of g,,, /,, and k,,, respectively. (We 
already assumed h # 0.) The system (126) admits one solution S;,, if 
gk - . Sie 
and only if ~ 0 and whenever k = 0, if and only if — 2} + 0° 
hh 1 \h 
As far as the system (12a) is concerned it is not difficult to find necessary 
and sufficient conditions in terms of the *X’s that it admit one solution 
only.’ We assume that (12a) admits only one solution. 
5. Relationship Between Vy, and *V¥,.--Put 


A 
B 


» 
Wyev ~ 
») 


wey 
Then we have easily from (12a, b) 


A 
B 


v 


‘ 8 . ‘ [8 ’ 
Rie? Avisy Re” + Wp bur” Arg, 


ul y [w 


2K ou + 2k? Buia, &” + 2k? k,\! Big, 


wy 


wav ~ 
Substituting from the second of (14) into the first one we obtain 


7int .. OK 
Ap» 2 . 


in’ Cope 
where 


J 4 Zins 


wey 


and ‘aa, = FA vi I is a well-defined sum of products of four components 


ky. If the system (12a) admits only one solution *S,,, and moreover 
if the system (124) admits only one solution S,,, then (15) admits only one 
solution so that 


* Sap y = Sapy , T cB K wu" ( I 6) 


= rene awflople - : . 
Here the tensor 7°65 = 7\04)5, 1s uniquely defined by 


Zin Toe 


“wu apy 
Hence if we consider (6) and (S), 1.e., 


*y ¥ “= fret 
Pye = inst FT 





510 MATHEMATICS: INONU AND WIGNER Proc. N. A. S. 


and substitute into it from (16) we obtain by virtue of (10) and (12b) 


"+ Qh Kg he? THY, — i(Sy,” — Bh Keg TEE) 
v av I” ‘ BT ee 
+h” Ken (2k? TH + i TH). 


xy ¥ 4 . 
Siw * 5y, ~~ thy 


Phy (1 + 1) Sry 

This formula establishes the relationship between the connections *I’x, and 
See 
* Prepared under Army Contract DA 33-008 OR D-467. 
' Kinstein, Albert, “A Generalization of the Relativistic Theory of Gravitation,” 
Ann. Math. (2), 46, 578-584 (1945) and “‘A Generalized Theory of Gravitation,” Rev 
Modern Physics, 20, 35-89 (1948). Einstein, A., and Straus, E. G., ‘A Generalization 
of the Relativistic Theory of Gravitation, II," Ann. Math. (2), 47, 731-741 (1946). 

* Einstein, A., The Meaning of Relativity, Princeton University Press, 1950, Appendix 
II 

ai tS ar ° . ‘ ° ° ° 

* The existence of },,§ is possible if and only if the determinant h of Ay, is different 
from zero 

Tr . . Ps Ap 

' The indices are raised and lowered by means of h”” and hy,. 

® Hlavaty, V., “The Elementary Basic Principles of the Unified Theory of Relativity. 
B,”’ J. Rat. Mech. Anal. (1), 2, 1-52 (1958) 

® Hlavaty, V., and Saenz, A. W., “Uniqueness Theorems in the Unified Theory of 
Relativity,” to be published in the J. Rat. Mech. Anal 

’ These conditions may be formulated in a similar way as it has been done for the sys 


tem (12h) in the paper mentioned in Ref. 5. 


ON THE CONTRACTION OF GROUPS AND TITEIR 
REPRESENTATIONS 
By E. INONU AND E. P. WIGNER 
PALMER PHYSICAL LABORATORY, PRINCETON UNIVERSITY 
Communicated April 21, 19538 


Introduction.-Classical mechanics is a limiting case of relativistic 
mechanics. Hence the group of the former, the Galilei group, must be 
in some sense a limiting case of the relativistic mechanics’ group, the 
representations of the former must be limiting cases of the latter's repre- 
sentations. There are other examples for similar relations between 
groups. Thus, the inhomogeneous Lorentz group must be, in the same 
sense, a limiting case of the de Sitter groups. The purpose of the present 
note is to investigate, in some generality, in which sense groups can be 
limiting cases of other groups (Section 1), and how their representations 
can be obtained from the representations of the groups of which they 
appear as limits (Section II). Section III deals briefly with the transition 
from inhomogeneous Lorentz group to Galilei group. It shows in which 
way the representation up to a factor of the Galilei group, embodied in 





VoL. 39, 19538 VATHEMATICS: INONU AND WIGNER 511 


the Schrédinger equation, appears as a limit of a representation of the 
inhomogeneous Lorentz group and also gives the reason why no physical 
interpretation is possible for the real representations of that group. 


I. CONTRACTION OF GROUPS 


Let us consider an arbitrary Lie group with » parameters a’ and in 
finitesimal operators /;. These shall be given, as usual by 
; g(he;) — g(0) 
I; = lm (1) 
an->v h 
where () are the parameters of the unit element and e; differs from 0 by a 
unit increase of a; The /, are skew hermitean matrices if the group 
consists of unitary matrices; instead of them one often uses the hermitean 
quantities 7/;. However, all our equations remain somewhat simpler if 
expressed in terms of the /;.. The structure constants C are defined by 


[Tis I; | is > Cul (2) 


If we subject the /; to a linear homogeneous non-singular transformation, 
the C will be replaced by other constants. These are obtained from the 
C by contragradient transformations of its upper and lower indices. How- 
ever, such a transformation has, naturally, no effect on the structure of 
the group. Let us denote the transformation in question by 


k= SR. 


It corresponds to the transformation 


a’ = IGN 12 (4a) 
according to which the / are obtained by the same equation (1) as the 
/, except that the e, have to be replaced in it by a similar quantity, defined 
with respect to the 0. 

The above transformation may lead to a new group only if the matrix 
U’ of (3) is singular. We shall call the operation of obtaining a new group 
by a singular transformation of the infinitesimal elements of the old group 
a contraction of the latter. The reason for this term will become clear 
below. The singular matrix will be a limiting case of a non-singular matrix. 
The latter will depend linearly on a parameter ¢ which will tend to zero: 


UF soy 


v v 


(4) 


For 0 < ¢€ < «& the determinant of (4) is different from zero, it vanishes for 
e = 0. 
We shall transform (4) into a normal form by a non-singular and 





512 MATHEMATICS: INONU AND WIGNER Proc. N. A. S 


independent transformation of both the J, and /;. If the matrices of the 


corresponding transformations are denoted by a@ and 8, respectively, u 
and w will be replaced by Bua~! and Bwa™'. It is possible, by such a 


transformation, to give u and w the form 


0 0 


The number of rows and columns in the unit matrix in “, and in v, is equal 
to the rank ry of u. It is advantageous to label the transformed J and / 
with a pair of indices, the first referring to the subdivision of u given in 
(5), the second specifying the various J and / within that subdivision. 


Hence, (3) assumes the form 


r) 


»2y 
The corresponding transformation of the group parameters 1s 
aj, by, + > 


Las 
u 1 


bi, (p 
(6a) 


ly, eby,. By cs ey tt = F) 


It is well to remember that the parameters a lead to the infinitesimal 
elements /, the parameters 6 to the J. The last equation shows that a 
given set of parameters ) correspond, with decreasing ¢, to smaller and 
smaller values of the parameters a. In the limit « = O (if such a limit 
exists), one will have contracted the whole group to an infinitesimally 
small neighborhood of the group defined by the a,, alone. This justifies 
the name given to the process considered. 

The transformation of the infinitesimal elements which we carried out 
also changes the structure constants and we shall write for (2) 


n r 


[Tan Lpyul = nid Cina eh 


K 1 


wherein a and 6 can assume the values | and 2. This gives for 


Ts» Isp] 4 €> (Py, yy’ + Sy Dye HF Ow yy) Lies Diy 
+- ()(1). 


S26 


Hence, if the commutator of J,, and J, is to converge, as ¢ —~ 0, to a linear 
combination of the J, the structure constants 





VATHEMATICS: INONU AND WIGNER 


Cw. tp 0 
i.e., the /,, must span a subgroup. On the other hand, if this happens 
to be the case, the structure constants will converge to definite values 


Cas, Ba | «(8S e—>O0 


' 2a 
Cy, lu 


Gua 
0. 


These structure constants satisfy Jacobi’s identities since the structure 
constants for the J do this for non-vanishing «. We shall say that the 
above operation is a contraction of the group with respect to the infinitesi 
mal elements /,, or that the infinitesimal elements /,, are contracted. We 
then have, from (9). 

THEOREM |. Every Lie group can be contracted with respect to any of its 
continuous subgroups and only with respect to these. The subgroup with 
respect to which the contraction is undertaken will be called SS. The con 
tracted infinitesimal elements form an abelian invartant subgroup of the con 
tracted group. The subgroup S with respect to which the contraction was under 
taken is isomorphic with the factor group of this invariant subgroup. Con 
versely, the existence of an abelian invariant subgroup and the possibility to 
choose from each of its cosets an element so that these form a subgroup S, is a 
necessary condition for the possibility to obtain the group from another group 
by contraction. 

It is easy to visualize now the effect of the contraction on the whole 
group. The subgroup S with respect to which the contraction is under 


taken remains unchanged and it is advantageous to choose the group 


parameters in such a way that ay 0 throughout S. The (6a) can be 
replaced by 


dy, j (ls, De, (Gb) 


and this can be assumed to be valid throughout the whole group, not only 
in the neighborhood of the unit element. As ¢ decreases, a fixed range of 
the parameter } will describe an increasingly small surrounding of S. As 
e tends to 0, the range of the 4,, will become infinite and describe only 
those group elements which differ infinitesimally from the elements of S. 
The elements which are in the neighborhood of the unit element of the 
original group but have finite parameters ),, will commute and form the 
aforementioned commutative invariant subgroup. Naturally, the ele 
ments of this invariant subgroup will not commute, in general, with the 
elements of the subgroup S: the change of the parameters dp, eb, will 
be, upon transformation by finite elements of S, of the same order of 


magnitude as these parameters themselves. Naturally, the convergence 





514 VATHEMATICS: INONU AND WIGNER Proc. N. A. S 


of the original group toward the contracted group is a typically non- 
uniform convergence. 

Every Lie group can be contracted with respect to any of its one para- 
metric subgroups. If the three dimensional rotation group is contracted 
in this way, one obtains the Euclidean group for two dimensions. Con- 
traction of the homogeneous Lorentz group with respect to the subgroup 
which leaves the time coordinate invariant yields the homogeneous Galilei 
group, contraction of the inhomogeneous Lorentz group with respect to 
the group generated by spatial rotations and time displacements vields the 
full Galilei group. Contraction of the de Sitter groups yields the inhomo- 
geneous Lorentz group. It should be remarked, finally, that if a group, 
obtained by contraction of another group with respect to the subgroup S, 
is contracted again with respect to .S, the second contraction remains with- 
out effect. 

The above considerations show a certain similarity with those of I. E. 
Segal.' However, Segal’s considerations are more general than ours as 
he considers a sequence of Lie groups the structure constants of which 
converge toward the structure constants of a non-isomorphic group. In 
the above, we have considered only one Lie group but have introduced 
a sequence of coordinate systems therein and investigated the limiting 
case of these coordinate systems becoming singular. As a result of our 
problem being more restricted, we could arrive at more specific results. 


II. CONTRACTION OF REPRESENTATIONS 


If one applies the transformation (6) to the infinitesimal elements of 
a representation of the group to be contracted, and lets € tend to zero, the 
J», will also tend to zero. The representation will become isomorphic to 
the representation of the subgroup 5S, 1.e., will be a representation of the 
factor group of the invariant subgroup. In order to obtain a faithful 
representation, one must either subject the Jy, to. an ¢ dependent transforma 
tion, or consider the Jy, which correspond to different representations, e.g., 
go to higher and higher dimensional representations as e« decreases. We 
shall give examples for both procedures. 

(a) Representations of the Contracted Group by Means of « Dependent 
Transformations. The first procedure is applicable only if the infinitesimal 
elements are not bounded operators, 1.e., as far as irreducible representa 
tions are concerned, only if the group is not compact. The simplest non- 
compact non-commutative group is that of the linear transformations 
x’ = e“y + 6. The general group element is O, , = 7(8)R, with the group 
relations 7(8)7(8’) = 7(8 + B'); RyRy = Ra4q and R,1(8) = T(e"B)R,. 
The only faithful irreducible unitary representation of this group can be 
given in the Hilbert space of square integrable functions of 0 <x < 





Vo . 39, 1958 MATHEMATICS: INONU AND WIGNER 
RwWix) = e'? *WV(e%x) Tsv(x) = eY(x). 
The infinitesimal operators are 
h='.+x g = 1, (10a) 


with the structure relation 


i, fe] = Le. (10b) 


ax 


Contraction with respect to the group of transformations x’ = e“* leaves 


the group unchanged: the only non-vanishing structure constant is C).” 


and (9%) shows that this does not change. We can try, therefore, to trans 
form /; and e/, with an ¢ dependent unitary matrix so that /; remain 
unchanged, the transformed e/, converge to /, 


S-hS,=1,  S,-hS, + hh. (11) 


€ 


This is indeed possible: one has to choose S, = R),,. This commutes 
with /,. It follows from the group relation that S,~'7(8)S, = R_,,.7(8) 
Ri. = 1(68/6) and one has, hence, as h ~ O 


65, 1,5, = 65, ' limak-\(T(h) -— 1)S, 
= lim eh"(T(h/e) —1) = hp (11a) 


Hence ¢S,~'/,S, not only converges to J, = /. but remains equal to it for 
all e. 

It is more surprising, perhaps, to see that the same device is possible 
also if one contracts the group with respect to the subgroup of the 7(). 
The contracted group is, in this case, the two parametric abelian group. 
We demand, in this case, that S, commute with /, and hence, by (10a), 
that it be multiplication with a function of x. Because of S’ unitary na- 
ture, we can give it the form exp (if(x, €)).. Transformation of the e/, of 
(10a) with this gives 


PFs) a 8 Oe Ne a ease 
= ¢€('/, + xd/dx) + exd dx f(x, €). (12) 


The first part of this converges to 0 as it should since (12) should converge 
to an operator which commutes with 7x. The second part converges to 
J», = ixf’(x) = ig(x) if one sets 


f(x, ©) en ite). (12a) 


Hence, the transformations of the contracted group corresponding to the 
parameters a, 8 is multiplication with 


ecm (1:3) 





516 VATHEMATICS: INONU AND WIGNER Proc. N. A. S. 


which is again a faithful (reducible) representation of the contracted 
group. 

The operators corresponding to finite group elements could have been 
obtained directly by transforming O, ., = 7(«8)R, with Rj, in the case 
of contraction with respect to R,. Similarly, (13) could have been ob- 
tained directly by transforming O,, , = T(8)R,, with exp (e~'7f(x)). 

It is not clear how generally one can obtain a faithful representation 
of the contracted group as a limit of an e dependent transform of a faithful 
representation of the original group and the substitution (6b) of its param- 
eters. Certainly, the procedure is not applicable to irreducible repre- 
sentations of compact groups or, more generally, if the infinitesimal oper- 


ators are bounded. 

(b) Representations of the Contracted Group from a Sequence of Repre- 
sentations. We shall now give a few examples for the second procedure, 
i.e., obtaining a representation of the contracted group by choosing a 
sequence of unitary representations D", D®, » D®, so that each 


of the operators 
i, de, : , > eT, 2, ,n — 1) (15) 


converge to a finite operator as « > 0 and /—~ . Alternately, we can 
ask that the transformation corresponding to finite group elements 


D' (by, €bs,) (15a) 


converge to a unitary representation of the contracted group as « > 0, 
|— o, ‘The > are the parameters of the contracted group; the operator 
(15) corresponds, in the representation /, to the group element of the 
original group the parameters a of which are given by (6b). The formula- 
tion making use of the finite group elements (15a) is unambiguous because 
it deals with the convergence of unitary operators; the first one is usually 
easier to attack directly. 

The convergence of the sequences of (15) and (15a) will depend not 
only on the values which ¢ assumes and on the corresponding representa- 
tions D°”, it will also depend in which form that representation is assumed. 
Hence, method (a) can be considered as a special case of the present method 
in which all D‘° are unitary equivalent. 

The contracted group is always an open group because the variability 
domain of the is infinite. Hence its representations are, as a rule, 
infinite dimensional. If the D‘” are finite dimensional, they should be 
considered to affect only a finite number of the coordinates of Hilbert 
space. 

The simplest non-commutative compact group is the three-dimensional 
rotation group. All its subgroups are one parametric, we shall contract 
it with respect to the rotations about the z axis of a rectangular coordinate 





VoL. 39, 1953 VATHEMATICS: INONU AND WIGNER 517 


system of ordinary space. The contracted group is the Euclidean group 
of the plane, i.e., the inhomogeneous two-dimensional rotation group. 
We shall choose for D‘” the representation which is usually denoted? by 
D®: itis 2/ + 1 dimensional (/ being any integer) and is usually described 
in a space the coordinate axes of which are labeled with m = —/, —/ + 1, 

,/—1,/. Hence, we label the coordinate axes of Hilbert space with 
all integers m from —© to ©. In keeping with our previous notation, 
we call the infinitesimal element which corresponds to rotations about the 


, 


zgaxis M, = J,. One can then write for |m| <1, |m’| <1 


(Tam = (ML) am? = iMbam’ (16) 


l 


(To. )mm = (Mi \ame = — 5 VL — mL + om’) bm 4 


| 


») 


X (/ om m’)(1 + M1) bin’ 1 


(MO \nn? = — m)(l + m’) bm'msi 4+ 


l , c 
5 VE — m')(L + om) bnm—. 


All matrix elements vanish if either |m) or (m’) is larger than /. As 
| — o, the /; converge to a definite operator 


(Ji )iewt = 28 bee’ (— © < m, m’ « (17a) 


In fact, the convergence is strong in the sense that /;y converges strongly 
to Jig if ¢ is in the definition domain of J;. If « and / converge to zero 


and infinity, respectively, in such a way that /e —~ =, the other two in 
finitesimal elements will also converge to J2,, J2,, respectively, where 


1 
S meso ay 
2) eer > E\On'm—t ~~ Cuinri) (17b) 


l 


(Ses) sawn! 5 (Om’m—1 TT Om’ n4+1)- 


It follows from the fact that there are only a finite number (two) non 
vanishing matrix elements in both /,, and /», that the operators satisfy 
the commutation relations of the contracted group 
[Jory Sry] 0 lJ, Jor} = 2 LJi, Soy} Joy. (18) 
It further follows from 
(1, )? + (To,'")? + (Jo)? -/(1 + 1) 


by multiplication with e? and going to the limit in the above way that 


=? (1Sa) 














518 MATHEMATICS: INONU AND WIGNER Proc. N. A. S 










It was important, for obtaining convergent sequences of /,‘’ and e/2"", 
to have assumed the D' in the form given by (16). This form was 
reduced out with respect to the subgroup S and this caused, in this case, 







the convergence of the sequence /,'”. The convergence of the ¢/,‘” does 
not actually follow from the convergence of the /;'”, and hence from the 
reduced out form of the representations D'”, but was made at least possible 







by this circumstance. 
Before going over to the investigation of the Lorentz groups, it may 
be worth while to make a final remark about the above contraction, even 
though it has little to do with our subject. We shall determine, first, 
the matrices which correspond to finite group elements. For this purpose, 
it is useful to consider that form of the above representation in which the 
Hilbert space consists of functions of x and y and the infinitesimal operators 
have the natural form for infinitesimal operators of the Euclidean group |‘ 












(a and r are polar coordinates x = r cos a, y = 7 sin a) 






J, = —0/0a = yO/Ox — xd/OY (19) 

















Js, = O0/Oy Ja, = —0/dx. (19a) 


f 






One should keep in mind that /,, arose from /,, which is the infinitesimal 
rotation about the x axis and corresponds to a displacement in the —y 






. . «* . / . . . . 
direction. Similarly J/,, corresponds to a displacement in the x direction. 


The function g(a, 7) corresponds in the new Hilbert space to the vector 
which has the components ¢,, in the Hilbert space of (17a), (17b). It 








further follows from (1Sa) that 


: p= —E’ ¢ 20) 
XOx* ~~ soy" . . 






whence one can write 










g(x, yy) = JS ene moe g(a’ )da’ (20a) 
JS 7 a Mi g(a’ )da’ 


‘ 1=r cos a’ , , 
JS eC gla — a)da’. 










All integrations are from 0 to 27. Expanding g(a@ — a’) into a Fourier 






: 4 > : , - 7 
series of a — a’, one finds 
. S * .— 827 cos a’ FS ima 
g(xX, y) = Se ; > Lm e 


¥ ie hel - . , 

mruns from — © to ©, The last form makes it easy to calculate J; by 
. ° . , , ° . . 

(19), the first form permits one to calculate J,, and Jy, easily. Comparison 


of the expressions obtained in this way with (17a) and (17b) shows that 
2m ¢m- In order to remain in keeping with the usual notation, we 









*) da’ (20b) 









define® 








VoL. 39, 1953 MATHEMATICS: INONU AND WIGNER 


2eJ.(s) = 8* Se Be oO da’. 


J, is then the ordinary Bessel function of order m. This permits one to 
write for (20b) 


m j ma 


g(a,r) = 24d epi "e Jn(=r). (21) 


Because of (19a), one can write down at once the finite group operations. 
In particular, for the displacement 7(£, 7) by — and n in the x and y direc- 
tions one has 


T(é, n)e(x, y) As — & ¥ n). (22) 


Hence, denoting the matrix for the same operation in the original Hilbert 
space by 7(£, 1) mm’, one has 
g(x << &, i i n) ™ Jr ym 7; T(é, nN) mm'Pmil me es Juler)- (22a) 
m m 
This permits an explicit determination of the 7(£, 7)». We shall not 
carry this out completely but set only r = Oin (22a). Since all J,,(0) = 0 
except J)(0) = 1, the summation over m disappears on the right side. 


The left side becomes, at the same time by (21) 


‘ * 8 ) - ) 
o(—& —n) = 24 Do oni "ee t+” J, (Ep) (22b) 


where 8, p are the polar coordinates for &, 7. Comparing (22a) and (22b) 
one finds, with € = 0, 8 = '/sa,n = p 


Jm(Ee) = TO, p)om.- (23) 


The group relations and the form of the infinitesimal operators (19a) gives 
at once the most important relations for Bessel functions, such as the addi- 
tion theorem, differential equation (cf. (20)), ete. Up to this point the 
argument is not new but merely a repetition, for the two dimensional 
Euclidean group, of a similar reasoning given before’ for the rotation 
group. This led to the equation‘ 


(1 — m)! 


(1 + m)! 


1/2 
) P* (cos (3) (24) 


D (0, BO)om = P™(cos B) = ( 


in which P?" is Legendre’s associated function of the first kind, P” is normal 
ized to the same value as P;’ = P,. Furthermore, 0, 8, 0 is the rotation 
about the x axis so that, by the definition of 7(0, p) 


T(0, Da'n = (—- ”’ lim DC (0, =p bE Or. 


This, together with (23), gives the asymptotic expression for the associated 
Legendre functions® 


lim P" (cos (p/l)) = Jn(p). 


_—> « 





VATHEMATICS: INONU AND WIGNER Proc. N. A. S 


III. CONTRACTION OF LORENTZ GROUPS 


Let us consider, first, the inhomogeneous Lorentz group with one space- 
like, one time-like dimension. It is given by the transformations 


x’ = xcoshA + /¢sinhA + a4, (26 
r x sinh \ + t cosh dA + a. 


We wish to contract it with respect to the subgroup of time displacements 
t’ f+ a, ‘The infinitesimal elements of (26) are: time displacement 
/;, space displacement /., and “‘rotation’’ in space-time /,,.. Their com 


mutation relations read 


ToesTa) = ~Iy. (27) 


2A4 


ye] = 0 [IJinl = —Iee | 


Hence, by (9), the commutation relations of the contracted group are 


Stet O- Ua drie ~%,- th, Isl =e Ga) 


The “rotations” in space-time, together with the displacements in space, 
form a commutative invariant subgroup. 
The matrices 
cosh \ sinh A ay 
sinh \ cosh A as (26a) 


0 0 l 


form a natural, though not unitary, representation of the group of trans 
formations (26). We can carry out the contraction by setting a, = 6, 
r @, a; = eb, or X = v/c, a, = b,/c and letting € converge to 0, 
or ¢ converge to infinity. If we do this directly in (26a), the representation 
will not remain faithful for the contracted group. We shall transform 
therefore (26a) with a suitable « (or c) dependent matrix: multiply the 
first row with c, the first column with I/c. If ¢ goes to infinity in the 
matrix obtained in this way, one obtains the transformations of the con- 
tracted group 
~ x ea (27a) 
’ 
It is the inhomogeneous Galilei group with one spatial dimension. The 
transformations x’ x + et + b,,t' = t form the commutative invariant 
subgroup. 
The same contraction can be carried out for an inhomogeneous Lorentz 


group with an arbitrary number of spatial dimensions. The only difference 
is that the subgroup S, with respect to which the contraction 1s carried 


out, contains not only the displacements in time as in the above example, 
but also all purely spatial rotations, i.c., all homogeneous transformations 





VoL. 39, 1953 MATHEMATICS: INONU AND WIGNER 521 


which leave / invariant. The invariant subgroup of the contracted group 


consists of all spatial displacements and Galilei transformations: x, = 
x, tottod, =. 

Contraction of the Unitary Representations of the Lorentz Groups..-We 
shall be principally concerned here with the group of the special theory 
of relativity, 1.e., the inhomogeneous Lorentz group with three space-like 
ind one time-like dimension. The subgroup S with respect to which we 
shall contract it contains the displacements in time, the spatial rotations, 
and the products of these operations. The contracted group is the ordinary 
Galilei group, 1.e., the group of classical mechanics. 

We shall denote the displacement operators in the direction of the three 
space-like axes by /; (k = 1, 2, 3), the displacement in the direction of the 
time axis by /). The rotations in the &/ plane will be denoted by /,), the 
acceleration in the direction of the k axis by /,». /y) and /,; span the sub- 
group S. 

The quantity 


PR+iz+ij3-I; =P (28) 


is a constant in every irreducible representation and the irreducible repre 
sentations can be divided into three classes according to the value of this 
constant.® In the first class, P < 0 and the momenta (which are —7 times 
the infinitesimal operators) are space-like. In the second class P = 0 and 
the momenta form a null vector, P > O in the third class, 

It is generally admitted that the representations of the first class have 
no physical significance because the momenta of all observed particles are 
time-like or null vectors. Hence we shall investigate only the simplest 
one of these representations. Its operators are most easily given in the 
Hilbert space of functions of three variables /;, po, ps which are restricted 
to the outside of a sphere of radius V —P. The expressions for the in 
finitesimal operators are 


Ik = ipe ly = —i(pi + pi + Pp + PY”. (28a) 


This last equation also shows the reason for the variables p, to be restricted 
by pi + pr + p; > —P: if this inequality is not fulfilled, /) ceases to be 
skew hermitean and, hence, the representation is not unitary. Further 


Ie: = pr. O/Op, p, O/ OP; (2Sb) 


Teo (Pi + Ps + py + P)'* 0/Opy. (28e) 


It is useful to introduce new variables instead of the p; both in order to 
sunplify the definition domain of the variables and also to bring the opera 
tions of the subgroup S into a form which is independent of P. This can 





MATHEMATICS: INONU AND WIGNER Proc. N. A. S. 


be done, most simply, by introducing as new variables 


bo = (Pit Prt pst PY? — % = pr/(bo — P)”’. (29) 


The { are restricted to a unit sphere, py) changes from 0 to ©. In these 
variables, the scalar product between ¢g and wW reduces to 


(y, y) = S dQ. “dpy( Pi — P)'*sy (29a) 


dQ being the surface element of the unit sphere over which the integration 
with respect to the (2 is to be extended. In terms of the new variables, the 
infinitesimal operators assume the form 


Tp = 20/0 — 0/02, In = —tpo (30) 


i (p, — P)'” (30a) 


2 > 1/2 Po 

| (py — P)'"Q, 0/Opo — ~~ 1/2 Dasti(S2, O/OQ, — Ly 0/02). 
(Pp — Py’ l 

If we now set J, = e/,, the J, will converge to zero unless —P becomes 

inversely proportional to e’, i.e., unless —e’P converges to a definite limit 

P. If this is assumed, the second term of e/, 9 will converge to zero and 

the infinitesimal elements of the representation of the contracted group 

become 


J a = 02,0 O22, — (), O O02, Jo = —tpo (31) 
J x0 = = PQ, re) Opy ] ke (dla) 


These operators indeed span a unitary representation of the Galilei group: 
they correspond to case II with m = 0 of a recent determination of these 
representations.’ The p, of this article correspond to our PQ,, the variable 
s is given by 1 PO/Opp. One also understands now why it was impossible 
to find a physical interpretation to this representation: it is the limiting 


case of a representation of the relativistic group with imaginary mass P'’. 
The same is probably true of the other true representations’ of the Galilei 
group. 

We go over now to the investigation of the simplest representation with 
positive P, i.e., the representation of the Klein-Gordon equation. The 
infinitesimal operators are again given by (28a), (28b), (28c). However, 
since P is positive, the variability domain of the p extends over the whole 
three dimensional space. The scalar product of two functions ¢ and y 1s 
now given by 


. 


(e, ¥) = eS i dp, dp, dps po oY (32) 


in which fp is still given by (29). 





VoL. 39, 1953 MATHEMATICS: INONU AND WIGNER 


If we set, in the sense of (6), 
Ji — el, = elp, 


the operation of time displacement becomes 


iJ, = ih = (-f —R- 122+ P) = 


e '(-Jj — Jp — Jg+ &P)* (83a) 

Since the J, are skew hermitean, their squares are positive definite 
hermitean operators. Since e’P is also positive the expectation value 
(y, tJog) of 1Jy is, for any state y, greater than e~! times the expectation 
value of 7J,. It follows that if /;g converges to a vector in Milbert space 
as e > 0, the vector Jog must grow beyond all limits. The same is true, 
of course, for the other J/,. It follows that the representations considered 
cannot be contracted in the sense discussed in the previous sections and 
the same is true of all representations of the classes P > 0. 

It is possible, however, to contract these representations to representa- 
tions up to a factor of the Galilei group. The commutation relations of 
the infinitesimal elements of representations up to a factor differ from the 
commutation relations of real representations by the appearance of a 
constant in the structural relations. Hence 


dicts J, + dagl (34) 


where cj, are the structure constants of the group to be represented (in 
our case, the inhomogeneous Galilei group) and the 6,,1 are multiples of 
the unit operator. One will, therefore, obtain infinitesimal elements of 
representations up to a factor if one sets, instead of (6) 


J, = Ty — al Jo, = els, — ae,! (34a) 


av 2% 


in which all a@ may depend on e¢. Since the additional terms in (34a) 
commute with all other operators, these additional terms will not affect 
the left side of (84). Hence, they must be compensated also on the right 
side and this is done by the additional terms ),,1. The point of introduc 
ing the terms al in (34a), which then necessitates the introduction of the 
b in (84), is that the right sides of (84a) may converge to finite non-vanish 
ing operators even if the /;,, €/., cannot be made to converge. 

The above generalization of the concept of contraction indeed allows 
a contraction of the representations given by (2Sa), (2Sb), (2Se) also for 
P>O. As we let P go to infinity, /) will also tend to infinity (log, Log) 


converges to infinity for all ¢). However, subtracting —iP'*1 from Jp, 


it will converge to 


ve lim —i(—J} — J; — + P)'? + iP’ 


= (1/2P")(i + +X) = G/2P "7 é)(J? + J3 + J). (35) 





MATHEMATICS: INONU AND WIGNER Proc. N. A. S. 


This shows that /) will converge to a finite operator if the J, = ¢/, do and 
if Pe converges to a finite constant mas P + ©,¢e-—~»(Q. Both can be 
accomplished by assuming the representation in such a forin that, instead 
of (28) 


I, = ip,/e 1, iP + (pi + ps + p5)/e)'”. (36) 


This is indeed possible because the variability domain of the p, is un- 
restricted and the above form of the infinitesimal elements can be obtained 
by unitary transformation of the operators given in (28a). Such a trans 
formation leaves the /,, of (2Sb) unchanged but transforms the J, of 


(28c) into « 
Lan -~«(P + (pi 4 ps Zo p;) €)'2 Op,. (36b) 
Hence we shall have 


Je lim —i(P + (pi + pe + p3)/€)"? + iP”? = 
-~(1/2m)( pi + pr + pi) 
PO/OPx — PxO/OPr (37. 


lim e(¢py/e) = tp, 
lim —e(P + (pi) + pr + pi)/€)'0/Op, 
lim —(Pe! + €(p; + po + ps))'0/Op, = —md/Op,. (37.2) 


The reader familiar with the transition from the Klein-Gordon to the 
Schrodinger equation will recognize the increase of the rest mass with 
increasing ¢ and the elimination of this rest mass by the subtraction of 

iP' "| from the infinitesimal operator of the time-displacement operator. 
The infinitesimal operators (37.1), (37.2) for the contracted group are in 
fact those of Schrédinger’s theory. It is likely that a similar contraction 
is possible also for the other representations with positive rest mass (1.e., 
P > 0) but this and the behavior of the representations with P = 0 will 


not be further discussed here. 


' Segal, 1. E., Duke Math. J., 18, 221 (1951) 

2Cf. eg. Wigner, E., Gruppentheorie und ihre Auwendiengen ete., Friedr. Vieweg, 
Braunschweig (1931) and Edwards Brothers, Ann Arbor (1944). Chapter XV 

‘Cf. Jahnke, E., and Emde, F., Zables of Functions, Dover Publications, 1948, p. 149; 
or Watson, G. N., Treatise on Bessel Functions, Cambridge Univ. Press, 1922, p. 19ff 

' Reference 2, Chapter XIX, particularly p. 230, 232. Cf. also Wigner, E. P., J 
Franklin Inst., 250, 477 (1950), and Godement, R., Trans. Amer. Math. Soc. 73, 496 
(1952) 

’ For m 0 this is given on p. 65, of Watson's Bessel Functions (ref. 3) 

*Cf. eg. Bargmann, V., and Wigner, E. P., these PROCEEDINGS, 34, 211 (1948) and 
further literature quoted there 

7 Inonu, E., and Wigner, E. P., Nuovo cimento, 9, 705 (1952), 





Vo. 39, 1958 WATHEMATICS: KALLIANPUR AND ROBBINS 


ERGODIC PROPERTY OF THE BROWNIAN MOTION PROCESS 
By G. KALLIANPUR AND H. Ropsins* 


THE INSTITUTE FOR ADVANCED STUDY AND UNIVERSITY OF NORTH CAROLINA 


Communicated by J. von Neumann, April 16, 1958 


1. Introduction. In a previous paper (“On the Equidistribution of 
Sums of Independent Random Variables,’ to appear elsewhere; an abstract 
will appear in Bull. Am. Math. Soc., 59, (May, 1953); we shall refer to this 
paper as {1]) we considered some properties of the sequence S, of partial 
sums of independent and identically distributed random variables or 
random vectors in two dimensions. Here we show in Theorems 1-4 
that the results of |1] carry over to the Brownian motion process in one 
and two dimensions. 

2. The Two-Dimensional Case.-Let X(t) denote the Brownian motion 
(Wiener) process on the line: (0) = 0, X(¢) 1s continuous for all ¢ with 
probability 1, and for any fy < f < < /, the random variables X(t;) — 
X(tj-1),7 = 1, ,n, are independent and normally distributed with zero 
means and variances /; — t);.. Let I(t) = (XY (4), ¥(¢)) denote the Brown 
ian motion process in the plane, the two components of I’(¢) being inde- 
pendent one-dimensional Brownian motion processes. Suppose that 
f(x, y), g(x, y) are real valued functions which are bounded and summable 
in the plane —» <x < »,—-» <y< w,andsetf = SS f(x, y) dxdy, 
2 = JS Sfalx, y) dxdy, where here and in the sequel an integral sign 
without limits denotes integration over ©, ©). We shall prove the 
following two theorems for plane Brownian motion. The corresponding 
results for the one-dimensional case involve no essentially new arguments 
and will be stated without proof at the end of the paper. 

Turorem |. [ff # O then for every u, 


2a “7 
lm Pri: aa | {(Vit)) dt< u G(u), 
T— « f log / J 


where G(u) = 1 — e“ foru> 0, 0 foru <0 
THEOREM 2. [f g 4 O then 
Tes yr 
So f(V(p) dt f 


lim =7 


: in probability 
_— J0 g( V(t)) dt L 


Proofs of Theorems 1 and 2: Assume f # O and g # 0 and define 


» *n ») n 
ib wor I f(Vit)) dt, Wf —- 
0 


flog nm j=1 


flogn . 


with corresponding definitions for Z,(g) and IV,(g). For any positive 





526 VATHEMATICS: KALLIANPUR AND ROBBINS Proc. N. A. S$ 


integer n, each component of V(m) is the sum of m independent random 
variables, each distributed normally with zero mean and unit variance, 
since 


n n 


X(n) = S(XG) -XG—-D], Vn) =X (VW) - YG —-)). 
1 


j j=1 
A theorem in [1] gives 


lim Pri{W,(f) <u] = Ga). 


We shall later prove as Lemma | that 


lim £[Z,(f) — W,(f)]? = 0. 
It follows from (2.4) that Z,(f) — W,(f) tends to zero in probability, 
and hence from (2.3) that Z,(f) has the same limiting distribution, G(u), 
as W,(f). This proves (2.1) as 7 — © through integer values, and the 
extention to arbitrary 7’ is immediate, proving Theorem 1. 
To prove Theorem 2 we shall later prove as Lemma 2 that 


lim /{Z,(f)Z,(g) — W,(f)W,(g)| = 0, 


n> © 
and we make use of the fact, proved in [1], that 
lim EW2(f) = lim EW2(g) lin EW,(f)W,(g) = 
a> = u—> & 


Sy” wdG(u) = my, say. 


— 


From (2.4)—(2.6) it follows that 


lim EZ (f) lim EZ(g) = lim EZAf)Z.(2) = Me, 


— n> = LU ead 


and hence lim /{Z,(f) — Z,(g)|? = 0. 


n—> « 


Asin |1], given « > 0 we choose 6 = 6(€) > O such that 6< «, G(é) < 
and NV N(e) (by (2.1) with f replaced by g) such that 


n> N implies Pr{Z,(g) > 6] > 1 — G(6) — "oe > 1 — «. 
: sate) E[Z,(f) — Z,(g)]? 
Since = Pr[{Z,(f) — Zn(g)}? < 6*] > 1 - 53 é' 
if we choose (by (2.7))K = A(e) such that 
n> K implies £!|Z,(f) — Z,(g)|? < 64, 
then if 7 > max. (N, A), 


Pr[Z,(g) > 6] > 1 - Pr[}Z,(f) — Z,(g) {2 < &]> 1 —- 





VoL. 39, 1958 MATHEMATICS: KALLIANPUR AND ROBBINS 
which in turn imply that 


Pr kee _ \U < ( > Pr[Z,(g) > 6, }Z,(f) — Zn(g)}? < 68) > 1 — 2e. 
\Z,(g) J 

Since ¢ was arbitrary this proves (2.2) as 7 — © through integer values. 

Again, the extension to arbitrary 7 is immediate. Finally, the restriction 

that f # 0 can be dropped by a simple argument and the proof of Theorem 

2 is complete. 


9 


3. Proof of the Lemmas.--We shall prove Lemma 2 first. We have 


Zz 2r_ Ss f’ nvm) at, W A, ee fy / 
ZAf) = f(V(t)) dt, W,(f) = > f(V(7)) dl. 
flog n p> J "y 7 f log n x | my — 


Set D, = E[Z,(f)Z2(g) — Wil f)W2(g)] 


4dr? n > *k 
p> | J E{ f(Vp)g(V (aw) — FV (a) )g(V(R)) | dudt 
k=1 j-1/7k-—1 


fg (log n)? ;, 


4r? n 
Qjky 


“2 fg (log n)* 2, 


where 
an = SS [0(t, u) — 0(j, k)| dudt, Rn = {7 —-1<t<j; k- 
R L< a< Bi, A(t, u) = El f(V())g(V(ae))). 


We want to show that D, ~ Oasn — o, 1e., that 


lim (logn)-? YS a, = 0. (3.1) 


n—> « j,k=1 


To evaluate 0(/, uv) we observe that for t; < tf, the random vector V(t.) — 
V(t) = (X(t) — X(t), V(t) — V(4)) has the joint probability density 
| x2 +. y? 
‘ th) + 


e 
2r\(to - t,) 


Hence for 0 < t <u, 


a hsial sieht 8 
Ot, u) = (x, vyig(x + x’, vy + y’) 
tr°t(u — t) / a / / Ie WE : ‘ 


2 4 2 ‘2 4 v2 


e 2t 2( ) dxdyd v'dy' 


I ea ee eee ms 
™ 4r°t(u — t) JSS S f(x, yelS, me“ dxdydtdn, 





528 VATHEMATICS: KALLIANPUR AND ROBBINS Proc. N. A. S 


where we have set 


L(x? + y?— (x — £)? + (y — »)?) 
Q(t, i) + . 
2 al f 
In what follows C will denote any constant whose numerical value is 
immaterial, and F = sup! f(x, y)|, G = sup/g(x, v)}. Then if (/, 
weR,; or €R;, 54117 2 2), 


G rr er 
Git) | = ——— / | f(x, y)ie 2 dxdy- 
drtt(u — thy. 
7 a. See — C 
¢ te - \idg'dy’ < f(x y)\dxdy < . 
J . aoa 


2rl 


Also, for 0 < t <u 


' ee eas 
At, u)| < f(x, v)-2(€, n) 
br't(u — <i f ff ‘ilies 


ee + 9 


F et ew C 
2% =dxdydtdn < In(u — t) J Fi als, n)|didy s u— 


n 
We write 5° a, as the sui of the following terms: 
1 


n 


n 
(b) » Qn; (b’) =. iy, 
3 


k 3 


Ug 4ioks (d) : Cl jks (d’) 
2 25 jSk—-2S5n-2 2: 
(e) ay + (ye + (>. 

To prove (3.1) it will suffice to show that each of (a), 
is of (log m)*}. 


C 
(a) From (3.2), if (¢, w)eR,;,7 > 2, | O(¢, w)| < - 4 
Y ee 


7 an oa C 
Also, | 4(j, J) : | ] f(x, y)g(x, ye a -GxOy S = : 
2rjiJ J = 1] 


n 
Hence > ts a4; S 


j=2 


(b) From (3.3), if (t, WeRy,, k > 3, 


= (\(log n). 


A(t, u)| < 


Also, | A(1, k) 





VoL. 39, 1953 VATHEMATICS: KALLIANPUR AND ROBBINS 


" 


Hence +s ay |< C- = O(iog n). 


(c) Again from (3.2) 
O(log 7). (3.6) 


(d) If (¢, ib aka - 2, then Q(/, u) > O and 


A(t, uw) — O(7,k) = =f Sn (x, v)ge(¢, n) fe~e"™ - 
: bar?t(u ; 
QO(),k)) I 
0G dx. . dy + — : x 
br? Lt(u t) qk DD 
dx d ° 


V0 7,k) 


fixwele, ne 
Setting 


RR. = tix) Sa (eb Say LE <a. taps ab R, complement of R,, 
let ¢ > 0 be arbitrary and choose a > 0 such that 


S P f(x, wel, n)| dx. ..dn < 


R, 


We can write 


. 7 


| ; 
Pon © es 
te*t(u — t). Ris bar=t(u 


where 


€ 


bart (ud t) (7 


To obtain a bound for J; we observe that 


Qt, uw) 


| )< 
k — j/~ 





530 VATHEMATICS: KALLIANPUR AND ROBBINS Proc. N. A S 


Hence 
| PO e {OU.u) ~QC7,k) | aya 


Ol j,k) A 
oa 1} < max. fe 


j if \fes vette QU.) | p HOU m) 01,4) _ 
- t) Ra 
FG z 3 1 
dns ye : (e° — 1) dyv...dy 
1e?2(7 — 1L)(kR —j — 1) Ra 


se , h¢ . 
Since e* < xe’ for 0< x < b, we have in R, 


[ a? , 4a? ] 
(j—1)9 (k—j—1) (hk ) 
1< Ae Nl< 


~ 


Therefore 





and hence 


Cla) l 4 ] | (3.8 
, 2.0) 
(j-1)(R-j- j-1lj (R-j— 1)(k —7) 


Turning to Js, we have 


| C¢ (Rk — j) — tu — bt) 


jk —j)| t(u — t)j(k — j) 
i(k — j) — tu — 1)} 
(7 — lk — 7 — I)(R — 7) 
Since 1 2j = j(k — 7) —jkR+ (FG — 1I?*S He —|J — tu —HNS HE — 
D-G-VR-N+P=kRAI-1 
Wk — j) — tu - t)| < max. [k +7 -— 1,27 — 1] = 
+s 4 ee Ef 1) 4 ~ 1) + FG. 
Hence 
+ + 
i(k —7) j(R-—7 — I)(R — 7) 


l l 
~ og Sea : = 
j—1)(k =| ir — 1)*(k —j — 1) 


l l 
. : 5 aera? - - t+ (3:9) 
(7 — 1)(R —37 — 1)? (7 - 17k -j - =| 


From (3.7)—(3.9) we obtain for 2<j< k — 2< 





MATHEMATICS: KALLIANPUR AND ROBBINS 


Ce 
"(j- DG 


Now from || 


while 


0 (log 7), 


(3.12) 


0 (log n). 


From (3.4)—-(3.6) and (3.10)-(3.13) it follows that 


n 
lim sup (log)? SO ayn < Ce, 
n—> o 3. R 1 
and since ¢€ is arbitrary this proves (3.1) and Lemma 2 


Turning to the proof of Lemma 1, let 


K. = | fi V(t)) dt _ ry fi vi) | = 
log n LJo joi 
| n we 
2 f(V(t)) — f(V(j)) | dt. 
log m j=] Jj-1 } 


n 
Then EK;, = (logn)-*» SY) dy, 
1, k ! 
where 


bis 


SS ELFVM) 
Ryk 


-AVIPIUS Va) — f(V(R))) dtdu 


SS (hot, u) — 0(t, k)| — (007, w) 


- 0(j, k)}] dtdu 
Rjh 





532 MATHEMATICS: KALLIANPUR 


and where we iow write 


A(t, u) Ef( V(t f(V(a)) 


| . 
1h ze 


For 2<j<h- 


1 or °t( 


for 0 < u. 2<n-2 


a] 


O(t, u) — Of, R) 


hart (U4 


k) 
| dx In + _ 
- A — marie & 
(C, ne 


aa 


In Ry, 


O(t, u) — Olt, k) 


set 


(y — )? 


Vt, wu) fOU, u)-QOtt 
Hence |e e 


Vil, u 


(x 
u) — OL, RIS 


Vu, k)} 


e 


< Olt, 


Thus, writing 


bar*t(u 


- 1)(k 


(j —]j 1) 


wh 


dx 


Sr°t(u — 


k—u 


Bais 
ii —-t)(k—t)~ ( 


(u 
- 
ni 7 = 1) 


Ge Gj 


Hence we have 


AND ROBBINS 


Ot, k) 


re + (y — »)? 


Proc 


N.A S. 


[ se wife, nie 
7 5 f(x, wife, nle7 PO ” 


pe f(x, vf X 


Jit Jo. 


adx...dn = 


of 
k—it} 


k—u 
> 0 
u — t)(k — t) 


a 1) <1 


o)? + (y — n)? 


») 


k—u 
(u — t)(k — bt) 


So ot aa 


“ae Ce 
we have J, < ’ 


| X 


C(a) 


jJ-lM(k-j- 13 











VoL. 39, 1953 MATHEMATICS: H. S. UHLER 


m Cc 
A(t, u) — Ot, k)| < — es + 
(7 — 1)(kR —7 — 1) 
C(a) . € 


; ; are : (3.14) 
(j — 1)(k —7 -— 1) (7 — 1)(k —7 — 1)? 


and similarly it can be shown that the right-hand side of (3.14) is an upper 
bound for | 6(j, u) — 6(j, k)|. Hence, as in the proof of Lemma 2, 


lim (log »)~? zi: b, = 0, 


n—> « 2sjsk 2sn 2 


and the other sums occurring in EK%, can be proved to be o| (log m)*} as 
before, completing the proof of Lemma 1. 

4. The One-Dimensional Case.-Let f(x) and g(x) be real valued func 
tions which are bounded and summable in the line — © < x < ™, and set 
f= Sfi(x)dx, @= Se(x)dx. 


Tueorem 3. Jf f # O then for every u, 


l i. 
lim Pr | ~were / S(X(t)) dt < u| H1(u), 
IvV1 J0 


T—>« 


2 2/9 e 
\; J e 7" dy foru > 0, 
where Hin) = | wT JO 


0 foru < 0. 


THEOREM 4+. Jf g # O then 


7, f(X(t)) dt 


“7 . = ~ in probability. 
Jo @(X(t)) dt i 


lim 


[_—- 


* John Simon Guggenheim Memorial Fellow. 


OMNIBUS CHECKING OF THE 61-PLACEK TABLE OF DENARS 
LOGARITHMS COMPILED BY PETERS AND STEIN, BY CALLET, 
AND BY PARKHURST 
By Horace S. UHLER 
YALE UNIVERSITY 
Communicated by J. B. Whitehead, March 30, 1953 

The first reference is: Zehnstellige Logarithmentafel. Erster Band. 
Herausgegeben von Reichsamt fur Landesaufnahme unter wissenschaftlicher 
Leitung von Prof. Dr. J. Peters. Berlin 1922. Table 14b, pages 156-162 of 
the appendix. The original source for Table 14b is acknowledged on page 
xix by the statement that ‘---this table contains the 61-place common 





534 MATHEMATICS: H. S. UHLER Proc. N. A. S 


logarithms of all the integers frem | to 100 and above 100 of the primes 
alone up to 1097 according to the data of Callet (7adles portatives de 
logarithmes, Paris 1795, an IIT), +--+.” 

The word “omnibus” in the title is intended to indicate that groups of 
one hundred or more logarithms we1e added and the sum compared with 
the logarithms of certain factorials conformably to the obvious relation 

n 
logi(n)(n — 1)(n — 2)---(k + 2)(k + IJ= DY log J = a log 2 + 
k=l 
dy log 3 + +++ + a,-,log/ = log(n!/k!) = log(n!) — log(k!), which may be 
conveniently solved for log(m!). The logarithms of the factorials were 
obtained from Stirling's asymptotic series log ['(x) = (log 24)/2 + (x — 
'/y)log x — x + YS (Cm/x?"~") + R where c,=(—1)"~'B,,/[(2m—1)(2m) | 
m= 

and P(x + 1) = x! for x integral. It is here assumed that, in a carefully 
computed table, the probability of false digits cancelling in the sum of the 
logarithms is negligible. Emphasis should be placed on the fact that this 
investigation pertains primarily to internal errors and not to terminal 
figures. From 101 to 1097 the work was complicated by the absence from 
Table 14b of all composite numbers since the ordinary factorials contain 
consecutive numbers regardless of their prime or composite character. 
Again since the logarithms under test are on the base 10 while those com- 
puted by the author are for the most part on the base e the arithmetical 
labor was increased by the necessity of multiplying by the modulus logy e. 
The numerical data next presented in detail should clarify any inadequacy 
in the preceding outline of procedure. 

We shall now consider only one typical case, namely, the calculations 
involved in the discovery of the error in log 839. For convenience in 
applying Stirling’s series the boundary numbers SOQ and 900 were chosen 
for one section of Table I14b. All of the natural numbers beginning with 
SOL and ending with 900 comprise the prime factors in the following prod- 
uct =. 2. 352.525. 716. 1 1 10. 1339. 178. 198. 235. 295. 314-373-41?-43?-472-53-59?- 
61-677-71-73?-79-83-89?-97-101 - 108 - 107 - 109-127 - 137-139. 149- 163- 
167-173-179-211-223-269-271-277-281-283 - 293-401-409 - 419-421 -431- 
133-439-443-449. Curiosity calls attention to the fact that the fol 
lowing 33 primes in the interval did not occur, namely, 113, 151, 151, 
157, 181, 191, 193, 197, 199, 227, 229, 233, 239, 241, 251, 257, 263, and all 
three digit primes whose common leading figure was 3. Primes greater 
than 449, such as 457, 463,---, are not admissible because each exceeds 
one-half of 900. By having Table I4b photostated and then slicing the 


negatives into narrow strips, accuracy was increased and time was saved 


either by juxtaposition or by machine adding of selected strips arranged 
under and flattened by a sheet of plate glass. In calculating log(900!) 
from Table 14b it was necessary to borrow from this table the logarithms 











Voc. 39, 19538 MATHEMATICS: H. S. UHLER 535 


of numbers greater than 127 because the author's own tables did not include 
the desired data. ‘Lhis procedure was quite justifiable because in the 
earlier part of the present investigation it was shown that only one interior 
error existed in Table 14b for entries less than log 457. This error falls in 
the 58th decimal place of the mantissa of log 227 where the 12th pentad is 
printed as “49465” instead of the correct group 49565. Stirling's series 
gave log(G00!) = 2269. 82947 GISBS 14835 45315 15825 70310 51541 
{8578 90591 62162 01337 59483 25989---. The value of this logarithm as 
obtained from Table 14b agreed perfectly with the number just recorded 
through the 57th decimal place. But the digits beginning with the 56th 
place were 595827. Hence the excess 00099 4 of the tabular sum over the 
true value indicated an error of one unit in the 5Sth place in some one of the 
printed logarithms of the 15 primes SO9, S11, S21, 828, 827, 829, 839, 893, 
857, 859, 863, 877, SSI, S88, and S87. The author's zero experience with 
games of chance caused him to calculate at random the logarithms of S09, 
827, 853, 857, S59, and 863 before the error of +1 wes run down in log 839. 
The same innocence led to the scrambled calculetion of the logarithms of 
1013, 1021, 1031, 1033, 1049, 1063, 1093, and 1097. Two errors were un 
veiled in log 1097. 

To recapitulate, all of the errors in Table 14b known to the author are 
collected in table 1. 


TABLE 1 
LOG OF DEC. PLACE for read 
27 61 2 3 
227 os 19465 19565 
293 3 1 
839 os 53974 53874 
1009 df 38238 38228 
1097 5S OO041 00941 


1097 8 7 

The author does not claim priority for the discovery of all of these errors 
but he does desire to stress the fact that the procedure explained in the 
earlier paragraphs definitively proves that no other non-cancelling interior 
errors exist in Table 14b. 

The extensions of the natural and denary logarithms beyond the 48th 
and 61st decimal places, respectively, as found in the present work, deserve 
recording. For the former see Tafel 13. In table 2 the last digits have 
been conventionally “rounded off.” 


Juxtaposition of the sliced photostatic negatives of the 61-place tables of 
Callet against Table 14b of Peters and Stein showed that the latter had 


copied the earlier table perfectly, that is, including errors, last digits, and 
every detail. This comparison indicates that Peters and Stein did not ful 
fill the promises made or attain the standard of accuracy proposed in the 





536 MATHEMATICS: H. S. UHLER Proc. N. A. S 


following translation from page IIf of the appendix. ‘‘While examining 
the pertinent literature it appeared to us that the published data were not 
always correct and they afforded no guarantee that the accuracy of one half 
a unit in the last decimal figure as aspired to by us was actually attained. 


Accordingly as a matter of principle we have calculated anew all of the 


numbers presented in the following pages. Even in the few cases in which 
the recalculation of the published data as far as the last decimal would have 
required an inordinate amount of time so that it has been omitted, such as 
for example the constant mw with 707 places, nevertheless the values have 
been recalculated at least in their initial figures (some 50 to 60 decimals) 
and the remaining decimals have been transcribed from the respective 
authors after the application of appropriate checks.’ The following 
verbatim quotation seems to be self-incriminating. Page V, Section 3. 


TABLE 2 
N UN N 467H PL. ONWARD LoG N 56TH PL, ONWARD 
293 89170 , 89475, 47248 , 42893 ,11916,277 38729 , 39794 , 53776 , 4460 
809 18300 , 59089 , 16663 ,95699 , 34535 ,089 02266 , IS182 ,60534 , 4009 
827 29126 ,45785 , 33904 , 60968 , 68762 ,048 57998 , 22870 , 51816 , 6934 
839 54281 , 75400 , 47882 ,99126, 14778 , 660 53874, 12199 385504 ,4051 
853 02716 , 27159 , 24195 ,03767 , 99674 ,090 38386 , 83091 , 45246 , 2237 
857 79717 , 94675, 50498 , 18461, 50815, 105 56694 ,64756 51171, 1988 
859 25879 , 46508 , 86537 , 58084 , 72940 , 592 60098 , 73672 , 29372 , 7937 
863 16602 , 38093 ,94780 , 30999 , 70284 , 132 02396 , 17110 ,47809 , 38550 
1018 71829 ,94572 ,08322 , 66959 ,06295 ,926 25895 , 37720 ,01904 ,5775 
1021 07072 ,O5161 , 50822 , 14128 ,60597 , 592 51893 ,638435, 12531, 1482 
1031 80558 , 68462 ,65744 , 35802 , 98414 ,342 30178 ,04759 ,00572 ,0830 
1033 79426 , 18876 , 27458 ,07365 , 89581 ,391 27216, 15291 , 12745 ,0294 
1049 28900 , 76540 , 21424 , 45926 , 19408 , 259 21217 ,27842 ,03186 ,9215 
1063 39965 , 32348 , 96201 , 63980 ,68427 , 869 26966 , 18299 ,06500 , 3304 
1093 03988 , 28952 ,06228 , 18694 , 22395 ,475 01056 , 14226 , 25516 , 3272 
1097 75652 ,65165, 11623 ,42347 ,77247 ,408 00941 , 74706 , 58168 , 38747 


“Mit Riicksicht auf die 61-stelligen Logarithmen bei Callet (hier Tafel 14b) 
haben wir sie 61-stellig gegeben und ihre Richtigkeit in gleicher Weise wie 
oben (s. u. Ziffer I) durch wirksame Kontrollen (auch hinsichtlich der 
Endziffern) gesichert."’ The book by Francois Callet had the title 7ab/es 
Portatives de Logarithmes: Parts 1795 (Tirage 1846). 

The author takes pleasure in gratefully acknowledging the temporary 
loan, as effected by his colleague Professor R. C. Archibald, of the volume 
containing the tables by Parkhurst. This booklet is the property of the 
library of Brown University. Comparison of the 61-place table with that 
of Peters and Stein showed the identical errors recorded above. Parkhurst 
doubtless copied Callet’s data for he wrote in the table of contents “XVITI 
Logarithms to 61 decimals, from Cala,---, p. 78."" Much credit redounds 
to Parkhurst for having devised home-made euphonic English and for 











VoL. 39, 1958 MATHEMATICS: E. B. WILSON 537 


assembling some of his numerical data in the most inconvenient and con 
fusing manner imaginable! 

For additional information the reader may consult the following refer 
ences. 


! Archibald, R. C., and Uhler, H. S. ‘Errors in the Tables of Peters and Stein in the 
Anhang to J. T. Peters Zehnstellige Logarithmentafeln, Vol. 1,"" Math. Tables and Aids 
to Computation, 1, No. 2, pp. 57-59, April, 1948 

2 Uhler, H. S., ‘Errors in Parkhurst’s 100-Place Tables of Logarithms,” ath. Tables 
and Other Aids to Computation, 1, No. 4, pp. 121, 122, October, 1948 

* Archibald, R. C., ‘New Information Concerning Isaac Wolfram’s Life and Calcu 
lations,” Math. Tables and Other Aids to Computation, 4, No. 32, pp. 185-209, Oc- 
tober, 1950 


SIGNIFICANCE LEVELS FOR A SKEW DISTRIBUTION* 
By EpwIn B. WILSON 
OFFICE OF NAVAL RESEARCH, Boston 
Communicated March 30, 1953 


1. The Symmetric Distribution.The first time one meets the problem 
of significance at about the 0.05 level is when one first writes x + 2a, or, 
if dealing with a rate construed as a probability, p + 2(pq/n)’*. The 


implication is that a statistical fluctuation from the value x or p will occur 
to an absolute amount of twice the standard deviation of x or p only about 


once in 20 times, or once in 40 times on either side. In a strictly normal 
distribution! the proper multiplier for the level 0.05 is 1.96, not 2. Various 
refinements up to Student’s ¢-test are familiar. 

In the case of comparing two rates, p; and p», in samples of m, and m2, 
the significance test is often put in the form 


9 (pi — p2)* Pi pr 


= > 3.54 > 


I | 
pan + ~) Vv Poqo \ . +- a 


1 


The y? distribution extends from 0 to © and, for one degree of freedom, 
is the positive half of the normal distribution doubled, viz., 


Of course, if f(z) is the distribution of 2 from © to » about 2 O and 
if f(z) = f(—z), the distribution of the absolute value z must be 2/()z ) 
and run from 0 to ©. The level of significance of a value z is refined as 





MATHEMATICS: E. B. WILSON Proc. N. A. S. 


the area under the distribution function f(z) more remote from z = 0 than 
£2, viz:,” 


S 3 f(z) dz + SZ” f(s) dz = 2S,” f(z) dz, z> 0. 


It can have any value from 0, when the stated value z is at the ends of the 
distribution, to 1, when it is at the center. All this is quite familiar: 
there seems to be universal agreement both in theory and in practice in 
regard to it, even though the words used may differ from one to another. 

2. The Target Problem.—-If one considers the two-dimensional x? dis- 
tribution in the form 


P(x? = 972) = fr? er dr =e", (1) 


which arises in the Bravais target problem for a circular pattern, the mode 
of the distribution in r is at r = 1, the median at r = (2 log, Q)'/ 
1.177, the mean at (w/2)’? = 1.253, and the root mean square at 
r= V2 = 1.414 -this last being the radius of gyration of the hits. If one 
wishes to estimate the bad shots in terms of the probability of a deviation 
so great from the bull’s-eye 7 = 0 and at the usual significance level 0.05, 
one has only to solve e~’”* = 0.05 for r to find r? = 5.991 and r = 2.447. 
Although in the ballistic problem one would probably be interested in 


r 


deviations from r = 0, one might have precisely the same distribution for 
rin a problem in which the interest was in deviations from some specified 
value of rsuch asr = 1.414. Indeed in the kinetic theory of gases it is the 
kinetic energy of the molecules which is of greatest interest and the distribu- 
tion according to velocity is the three-dimensional counterpart of (1). 
And one might very well be asking about the molecules, say, the 5% of 
molecules, whose velocity departed most from the root mean square 
velocity which determines the mean kinetic energy, and thus the tempera- 
ture. These aberrant molecules would be of two sorts, those with velocity 
well above and those with velocity well below the particular velocity 
specified, whether it be the root mean square, mean, median, mode, or 
any other. 

To have a name for it, I shall call the specified value the farget® value; 
it may be any value within the range of possible values. If 7’ be the target 
value and ro < T < 7, the condition for significance at the level L is by 
definition 


SJ 0 v dr + J e's dr = L. (2) 


3. Discussion of the Definition. li L = 0, the lower limit 7) must be 
the lowest limit of the range of the distribution and the upper limit the 
uppermost; if L = 1,ro =r = 7. If the lowest limit of the distribution is 
finite and the target value coincides therewith, ro = O and r, is determined 
by (2); and similarly if the upper limit of the distribution were finite and 7 











VoL. 39, 1953 VATHEMATICS: E. B. WILSON 539 


coincided with it, 7; would equal 7’ and ry would be fixed. In most other 
cases, % and 7; would either be indeterminate or must be fixed by some 
special convention to be adopted only after critical discussion of what is 
appropriate to the problem in hand. 

(a) For instance, if one is concerned with the actual absolute value of 
the difference |r — 7) = h, one should take ro = T — hand r, T +h for 
all values of 4 which do not carry beyond the range of the frequency func- 
tion with the understanding that if either 7) or 7; reaches one end of the 
range before the other, that other shal! be let run on to its end of the range. 
This procedure seems so natural for the discussion of the chance variation 
of the number of deaths in a population subject to a low death rate that it 
is almost universally adopted. 

(b) Secondly, it is often the practice to split the level L into two equal 
parts and apply each half to each tail of the frequency function. - If this 
procedure is to be followed right through for all levels from 0 to 1, it is 
necessary that the target value be the median; otherwise one would apply 
it only until either 7) or 7; closed down on the target and would then put 
all the rest of the level into the other tail. 

(c) In the third place, one might use the condition f(7>) = f(r) in the 
usual unimodal frequency function with tails running to 0 at the ends of 
the range, so that the definition of significance would cover all cases less 
frequent than f(7o) = f(71). Here, if consistently carried through to L = 1, 
the target value must be the mode, or some adjustment must be made as 
in the two previous instances. An alternative statement of the condition 
would be to require the range r; — 7 to be a minimum. 

(d) In the fourth place, one might divide L in proportion to the areas 
under the frequency function from 7° to the two ends of the range. Thus 
in the case of (1), if one were discussing significant deviations from r = 7° = 
1.414, the radius of gyration, which corresponds to the molecular velocity 
of the mean kinetic energy in the three-dimensional case of a gas, one would 
first find that the probability of a shot falling outside r = 1.414 is l/e = 
0.368 and would then determine ry and 7; by the equations 


Soe "'"'r dr = 0.632 L, S,” er dr = 0.368 L, 


so that ro = 0,252 and r; = 2.83, ff L = 0.05. 

All these four possibilities coalesce into a single one if the distribution is 
symmetrical; none seems to have been generally adopted; and there are 
still other possibilities. Of the four, the second and fourth are invariant 
under a change of variable, which is distinetly in their favor if and only if 
the variable is merely ordinal without any cardinal significance; and the 
fourth alone can be applied in general without modification all the way 
fron L = OtoL = |. There is another property of the fourth which de 
serves noting, namely: the (relative) probability that 7 be less than ro if 





540 MATHEMATICS: E. B. WILSON Proc. N. A. S 


less than 7 and the (relative) probability that 7 be greater than 7, if greater 
that 7, and the probability that r lie outside the interval ry <r < 7 are all 
the same. 

Whether this property is important may be judged by asking what would 
be the appropriate answer to the question: How to define significance 
level in case the only interest is in a part of the possible range of the vari- 
able? To be concrete, what should be the 0.05 significance level of wildness 
of shots outside the ring r = 1.414 around the bull’s-eye? If we reply by 
taking an area 0.05 off the infinite end of the whole distribution we get 
r = 2.45; and we must get the same result if we had specified any other ring 
ry = 71 S 2.45 instead of r = 1.414, so that from our specifications the 
“outside the ring r = 1.414 around the bull’s-eye”’ might better have been 
omitted. The question as actually put might better be answered on the 
basis of relative probabilities by r = 2.83. 

#. Variance Analysis. The fundamental frequency function is 

96:9! Fy ors 


9 


B(n,/2, no/2) (mF + m2)" ” 


New 
> 


2¢ 
B(n,/2, 92/2) (ne" + 2)" 

where F is the ratio of two variances s;°/s»? of nj and ny degrees of freedom, 
and z is half its natural logarithm. The distribution of z has a mode at 
2 = 0, which corresponds to equality of the two observationally determined 
variances; but the s-distribution is skew! unless m; = m. The question is: 
Is the ratio F so large or so small that its observation implies that the 
assumption that the two samples came from universes with equal variances 
is untenable at a specified level L of significance ? 

The target value of F is not 1, nor of 2 the correlative value 0; it is taken 
to be the median, which is that value which F (or z) is as likely as not to 
exceed if the samples come from universes of equal variances; and the 
significance level L is divided equally between the two tails as in the second 
possibility listed above. For example, if one takes L = 0.10 and m = 1, 
N» 2, one finds s) = —2.6479 and z, = 1.4592 with corresponding values 
for F, as recorded in the standard published tables.° The median is 

0.2028 for z and ?/; for F. 

The ¢-test for the mean, being symmetrical, can be stated as a test for 
F = ?, varying from 0 to ©, and when so stated the frequency function 
becomes identical with that for the ratio of two variances s,;? with one 
degree of freedom and s.2 with 2; but this does not mean that the /-test thus 
formulated is the same as a variance test,® because for the former the 
target value is 7’ = 0 whereas for the latter it is the median F. 











Vor. 39, 19538 MATHEMATICS: E. B. WILSON Al 


It is well known that the last row in the table for z, when 7. ©, IVES 
the value z = '/, log (x?/n,) for 0.05. This is as it should be because with 
nN» = © the value of so? must be o»”, the variance of the universe, and the 
ratio F then becomes simply x’/m. The target value from which x? is 
normally measured is 0 and that of ¢ is correspondingly — ©. This in itself 
is sufficient to show that the 0.05 table for z does not give the values of s at 
the 0.05 level of significance in variance analysis, for the target value of z 
is then not — ©;’ the 0.05 table for z gives one measure of the 0.10 signifi- 
cance level. This, I believe, many find confusing. 

5. A Bit of History.—The matter of the z distribution first came to the 
attention of a wide population of statisticians when the first edition of 
R. A. Fisher's Statistical Methods for Research Workers appeared in 1925. 
It was in Art. 41, pp. 192-193, that he wrote 


These tests are all reducible to the single problem of testing whether one estimate of 
variance derived from n, degrees of freedom is significantly greater than a second such 
estimate derived from m. degrees of freedom. This problem is reduced to its simplest 
form by calculating 2 equal to half the difference of the natural logarithms of the esti 
mates of the variance, or to the difference of the logarithms of the corresponding standard 
deviations. Then if ? is the probability of exceeding this value by chance, it is possible 
to calculate the value of 2 corresponding to different values of ?, ), and my 

When both , and mn are large, and also for moderate values when they are equal 
or nearly equal, the distribution of z is sufficiently nearly normal for effective use to be 


; es ; 1/1 l 
made of its standard deviation, which may be written \ ( + ) 
a\ 


2 nN» 


If we take nm. = n, = 24, the standard deviation of z is 0.2041 and 1.96 
times this is 0.4001, which would correspond to the 0.05 significance level 
in anormal curve; but the entry in the table is 0.3425, and this does in fact 
correspond to the 0.10 level in the normal curve.* The fact that Fisher's 
rule for the standard deviation in his second paragraph does not give the 
values in the table prepared in accord with his first, has perplexed a number 
of students, and has apparently been overlooked by many others.* 

One may use the ratio R; = z/o, as a normal variate to estimate sig 
nificance, as Fisher says. In the old days before Fisher invented the 
rigorous theory of variance analysis the manner of comparing s; and Ss» was 
by a difference as was, and still is, the case for two means; this would lead 
to the critical ratio 


Sy 


l 
| - 
2\n, nN» 


Like the ratio Rj, it would apply only when m, and mz. were equal and 


R, = 


moderate or both fairly large. If recourse be had to the result that 





MATHEMATICS: E. B. WILSON 


1G). 0-3 ae 


is approximately a normal variate of unit variance over the range from 
—0.8 to 2.4, even when n is small,'® one might set up a third ratio with 
some hope that it would give a good approximation over a considerable 


range even for tolerably small numbers and for numbers which were 


different, viz.: 


vt — $9 271 1 
(TH + a) it ( 2 ; V5 (., T 1 
ny + Ny 


To see how these three R’s compare at P = 0.10 when m, = me, one need 
only substitute for z in Fisher’s table VI (labeled P = 0.05) or Kendall's 
table 4, op. cit., p. 442, or any equivalent.'' One finds: 

n 3 = 6 ‘ n 24 
Rk, 2. 2.0! 1.98 78 1.678 
R, : 1.56 61 i 1 638 
R 4 1 657 659 ie 1 649 


The proper value for R is 1.645. The values of R; are too large, those of R» 
too small, and those of R; sometimes too large and sometimes too small. 
Whether one should compare the deviations or turn them into the corre- 
sponding probabilities, and how to compare the probabilities, are questions 
that would have to be discussed, and there might well be different opinions 
as to the rigorously correct answers; but the display seems to indicate 
fairly clearly that on the whole R, is the worst and R; is the best of the tests; 
indeed for equal n’s, R; gives satisfactory results from = 2 up, and the old 
fashioned Ry from n = 12 up more satisfactory than R, gives for n = 24. 
If next one applies R; to vn, = 2 and a variety of no's, one has: 
Ny = 3 No = 4 Ne = & Mm, = 8 nN» 
R 1.676, —1.623 1.682, —1.607 1.682, —1.580 1.678, —1.578 1.659, 


It will be seen that the positive limit increases a bit and then decreases but 
remains close to the proper value 1.645, whereas the negative limit falls off 
steadily to about — 1.55, and the trial of other cases seems to confirm this 
behavior. If one goes over to m, = 3, R; form, = © is — 1.604 and for ny = 
fitis —1.622. In the vicinity of this point of the normal curve a drop of 
0.01 in the abscissa brings a change to P = 0.12, or about 20%. It looks 
safe to say that for m and n, both greater than 3, R; will reproduce the 
table for P = 0.10 with an accuracy of within about 5%, i.e., P will lie be- 


tween 0.095 and 0.105. 











VoL. 39, 19538 MATHEMATICS: E. B. WILSON 548 


It should be noted that it is upon the negative side that the errors of R; 
are the more serious; this corresponds to the fact that the approximation to 
x’ is worse on the negative side, i.e., for values near 0. It is therefore to be 
expected that if one desired to work at the significance level P = 0.05 one 


would have to compute a table for that level up to the point where R; would 


give good enough values or one would have to find an improved approxima- 
tion to x” and base a test upon that; or one would have to make some 
adjustment in the expression for R; as given.'* 

Summary. The various desires of statisticians treating a great variety 
of problems will probably result in a considerable variety of definitions for 
significance. So long as there remain differences of definition care should 
be exercised to make sure that the definition adopted in any particular 
problem is appropriate to that problem and is made clear. 

It seems that there is involved in the concept of significance level some- 
thing I have called a target value of zero significance from which departures 
are measured on some sort of scale; they may, according to the problem, 
be measured in either direction or in both. 

In case there is no cardinal but only an ordinal meaning to deviations 
from the target value, the frequency function f(z), with its integral dis- 
tribution function /, where d/ = /(z) dz, may be transformed to any vari- 
able. In particular if / is used as the variable, the new frequency function 
becomes identically | over the range from 0 to 1, and should the variable 
IT = 2(/ — '/s) be introduced, the function would be constant at '/» over 
the range —1 to +1. If the target value were the median, the frequency 
function would then be symmetric with respect to its median and the skew- 
ness of the original distribution may be regarded as accidental.'* If the 
target value be not the median, the function f(z) can still be transformed, 


'/s over the range —1 to +1, 


at least conceptually, to the uniform value 
but as the target value will not be at the midpoint of the range, the original 
distribution f(z) taken with its target value z = 7’, has to be considered as 
essentially asymmetric. This is the case which offers the most serious 
liability to difference of definition of significance level between different 
statisticians. 

While it is my opinion that as yet no unitary definition or set of alterna- 
tive definitions can be imposed on the profession, I would urge consideration 
of my fourth definition in §2 (d),'* so that the significance assigned to // 
relative to the target //) be the same whether the interest is only in // > //) 
or [7 < Hy or in all values of //; and I should think that such must be the 
case if whether // be greater or less than //,) be the result of a convention 
applicable only after the calculations have been made, as if it should be 
agreed that the difference of two means should be so taken after their 
determination as to be positive. 

Some of these considerations were applied to that type of variance 





544 MATHEMATICS: E. B. WILSON Proc. N. A. S 


analysis in which the problem is the acceptance or rejection at a given 
significance level of the hypothesis that the samples came from universes 
with equal variances. A new approximate test therefor was suggested. 


* This is the third note on points in elementary statistics which seem not beyond the 
stage of discussion; previous ones in these PROCEEDINGS, 37, 696-704 (1951) and 38, 
8U9-905 (1952) 

' It seems to be customary to use the term distribution in two senses: (1) for the 
frequency distribution f(z), and (2) for the integral distribution function /(z) thereto 
related by d/ = f dz 

2 More generally, the limits may be a; one may still write the limits as + ~ if one 
will adopt the convention that f(s) 0 outside its meaningful range 

§ One should not confuse the target value with the bull’s-eye 

' The ratio f(2)/f(—<2), ¢ > 0, runs monotonically from 1 when zis 0 to0 or © when z 
is infinite according as m, < Mm, orn, > 1n 

® The tabulated values of ¢ are for the 0.05 point of the (integral) distribution function 
on the hypothesis that the larger variance is in the numerator and thus those values 
are all positive; an interchange of numerator and denominator changes the sign of 2 
Thus 2) = —2.6479 for n, = 1 and n.= 2 is found in the tables as z = 2.6479 forn, = 2 
and m, = | 

® It is instructive to deal with this in some detail. The usual form of the frequency 
function for the t-test is 


V nrl (" 


In either case the target value is 0; in the former at the center of the range, in the latter 


at the extreme left In this latter case one could introduce e° = ft and write 


(" + 
2r 
2 
n\ (1 + e7*/n)' 
Vanrl re 


which is the form of the s distribution for m, = 1 with the proper factor of nm = m.; but 
the target value remains at the extreme left, z = — « This z-distribution is itself 
symmetrical around ¢ = 0 when and only when » = 1, in which case it becomes (2/7 )e? 

(1 + e%). For the t-test it must be left in this form with s = — © as target value; 
but for the variance test with its target value at ¢ = 0, it may be doubled and restricted 


toz >0O. Thus for the variance test with 2; = m = 1, one has 


4 
v, and (" — tan «) = J, 
r\2 











Vo. 39, 19538 MATHEMATICS: E. B. WILSON 545 


If L = 0.05, the result is ¢ = 3.237; if L = 0.10, it is 2.542 as entered in the tables for 
the 0.05 point 

’ Tam speaking only of that type of variance analysis with which I have been most 
concerned in various biological, medical, and social applications with which I have been 
busied, viz., where the two samples are on an equa! footing and the problem requires the 
acceptance or rejection at the significance level L of the ‘‘null hypothesis” that in the 
universes from which the samples came the variances are equal. The derivation of the 
frequency distribution is found in M. G. Kendall's Advanced Theory of Statistics, Vol. I, 
p. 249 (unfortunately with some typographical errors in the original edition, but happily 
not serious ones). For the variance analysis of which I am talking the target value of 
x?/n; is not 0 but the median of x2/m, in a universe whose variance is a»? 

* The extreme difference from m, = ms is when nm, = 1, m2 = ©. For this the fre 


quency function is (2/7)'/*e “? with wv? = F = x?, if the variance with one degree of 


freedom be the numerator of F. The median is the ‘‘probable error’? « = 0.6745 to 
which correspond targets 2 = —0.394 and F = 4/9. The integral distribution function 
has the value 0.683 when z = 0, so that better than ?/; of the values of s,? are less than 
a2”, and an observed value s;2_ = a»? would indicate that the hypothesis a; = a» is on the 
border of rejection for L = 0.634; the same value of 1 would be indicated for F = 0.167 
and zs = —0.894 as for F = landz = 0 

* To do adequate justice to Fisher in dealing with this matter I must call attention to 
the initial words of the quotation ‘‘These tests,”’ for it is indeed possible that the tests 
to which he is there specifically referring are like the ordinary x? test in that for them the 
target value is 0 and all the value of 1 should go into one tail of the distribution. And 
he may intend that when the standard deviation is used, it shall be considered to apply 
only on one side of s = 0, and to the amount 1.6450 instead of 1.96¢. Furthermore it 
may well be that the two problems, one asymmetric, of determining whether one esti 
mate of variance is significantly greater than another, and the other symmetric, of ex 
amining a difference in terms of its standard deviation, are in fact different, though I 
find no explicit indication in these paragrapks that they are. I shall not, however, at 
this time go into this question nor into the more complicated matter of first and second 
kinds of error (a discussion of which can be found in Chap. VIII of Techniques of Statis 
tical Analysis by C. Eisenhart, M. W. Hastay, and W. A. Wallis, McGraw-Hill, 1947) 
I am not trying to prove anybody wrong or right; there seems still to be too little agree 
ment and too little clarity for that, and too much possibility that there is involved a 
variety of really different problems 

Wilson, E. B., and Hilferty, M. M., these PROCEEDINGS, 17, 684-688 (1931). M.G 
Kendall did us the honor to give this approximation in his Advanced Theory of Statistics, 
Vol. I, §12.8, and to show how good it was at P = 0.01, 9.05, 0.95, 0.99 for n = 20, 30, 40, 
50. We had no interest in its excellence except for small values of m. Form = 1 it is 
not good at P = 0.8, and although the least value x? can have is 0, at which P should be 
1.00, the approximation for m = 1 makes x? vanish for the normal deviate — 1.65, for 
which P = 0.95 instead of 1.00 

'! In terms of F or z we may write 


2/y » 


So /' 2 sinh (2/3) 


—— = = 

NoSo? ; my — Me . ‘ n 
cosh z sinh ¢ 

nN» ny, + Ne ny + Ne 


the computation of which will be easier in the one or the other form according to what 
computational aids are available. The values tabulated were obtained with the use of 
Huntington's Four Place Tables and the final digits are not trustworthy 

'2 A table for P = 0.05, in our sense is given by W. J. Dixon and F. J. Massey, Jr., in 





546 PHYSICS: L. P. EISENHART Proc. N. A. S. 


Introduction to Statistical Analysis, McGraw-Hill, New York, 1951, p. 310, with credit 
to E. S. Pearson, Biometrika, 33 (1943) 
'§ The case n; = 2, nm. = n will not be symmetric in z unless n = 2, but it is immediately 


integrable in a very simple way: 


l ; n 2 2/n : : 
“P ? » k= — 1], median F 
ery 2L\1—H 
1 + 
n 


The condition for significance at the level L is simply H = 1— L. Clearly H will be 
positive or negative according as F is above or below its median. 
‘4 Assuming, of course, that we are dealing with variables essentially ordinal; that 


the cases in §2(a) and §2(c) have cardinal applications is apparent. 


GENERALIZED RIEMANN SPACES ANDGENERAL RELATIVITY 
By LUTHER P, EISENHART 
FINE HALL, PRINCETON UNIVERSITY 
Communicated April 13, 1953 


A space of coordinates x' (7 = 1, , n) with which is associated a non- 
symmetric tensor g,; whose determinant g is not zero, is termed a generalized 
Riemann space. By definition 


l 
Am = 5 (Rix, 9 te Bea ies (1) 


~ 


where throughout this paper a quantity followed by a comma and index 
indicates the derivative of the quantity with respect to the coordinate 
with that index; thus gy, ; = Ogy,/Ox’. 

Throughout this paper g,; denotes the symmetric part of gj. It is 
assumed that the determinant of g,, is not zero. Quantities g” are uniquely 
defined by 

gug™ = 3j, 
where the summation convention of a repeated index, as 7 in this case, is 
applied here and throughout the paper, and 5; are Kronecker deltas. By 
definition 


Ai = rt Aijx al gan dy. (2) 


It can be shown! that these quantities in two coordinate systems x' and 


a 


x’ are in the relation 


07x" + al Ox! Or An Ox" 
a eae ee 





VoL. 39, 1958 PHYSICS: L. P. EISENHART 


From (1) one has 


] l 
Ain == 5 (Ai a Aj ix) bina 5 (Liz, j T Brj.4 ~ By )y 


= s 


~ - 


l l 
Ain = (A ijx as Asx) sae D (241, k + Ley i + Lik, j)s 
; ; h 
where g;; denotes the skew-symmetric part of g,. Denoting by Aj; and 
h iy , . . etal 
Aj; the symmetric and skew-symmetric parts respectively of Aj;, we have 
I j . 
Ai; = {i}, (5) 


ij 


where {*} is the Christoffel symbol of the second kind in the quantities 
24," and 


h h h > 
Su Mis - Lnidye = £nj Abi _ Aijze (6) 

From (2), (4) and (6), one has 
A, = Aj. = gr Ain - g Ass ae 0. (7) 


If in equation (3) the indices a and @ are interchanged and also the 
dummy indices 7 and j, one obtains 


a Ar Ox! Ox! rn, One 


Ox’?Ox’* By! Gy" — Ale yen" 8) 
‘ A : ° ° . ' dos & « 
When this equation is subtracted from equation (3) it is found that Aj; is 
| 1 ij 
a tensor. 
— . P yh 
Another set of coefficients of affine connection Ij; are related in two co- 
ordinate systems x‘ and x’ by equations of the form (3), that is 
02x" 
Ox’*0x"" 


, Ox! ja OO 


or = e 
+ I "J Ox’? Ox’? af Ox’? 


(9) 


Subtracting equation (3) from this equation, one has 


r a, OF OF , Ox" 


= (I — Ag} 
Wy i 43) Ox’ Ox’? m4 ap ~ ap) Ox’? 


from which it follows that, if we put 
vh h h 
Py = Ay + ayy, 


one h e 
the quantities a;; are components of a tensor. 
° F h 
It is now required that the tensor aj; be such that 


sh vh 
Zi.k = BraiVin + Zia gj 





5AS PHYSICS: L. P. EISENHART Proc. N. A. S 


From this equation it follows that 
h h h h h h h h 
ruk ™ Lnj Oyj) oe 2 in Xie - Lnjdir ‘ LZ inljp = Lnj( Qin 5 og AiR 2 2 Lnil Ajp = AjR)s 
h h . : : 
where aj, and a;, are the symmetric and skew-symmetric parts respectively 
ee ae é h ; “—_r 
of aj, similarly for a aj,, and in what follows. In consequence of (5) the 
left-hand member of this equation is identically zero. Consequently the 
} — i . ‘ 
tensor aj; must satisfy conditions of the form 
h h h i h h h 
LnjgQir t Enid jy + gnj(Ajpe + Air) + Lnil Ajp 4 Azz) 
Permuting 7, 7, k cyclically twice, one obtains respectively 
h h h h h h 
Ends + Bnei + Sne(Ayy + Gj) + ny( Ape + dei) 


h 


h h h 
Gpj) + Ln ( Ajj + a,j) 


) 


h h i 
Lnilik + neig + BnilA 
One-half the sum of these three equations is 
Bnidjn + nithi + Saat; = 0. (12) 
By means of this result the first of the above equations may be replaced by 
h h h h h ‘ 
LniQyj _ £nj) (Ai r jx) + Lnil Aix = jk). (13) 
When the expressions of the form (11) with the values (10) of the I's 
are substituted in equation (1), the result is reducible to 
h i h h 
@yj(Aas + On) + galdu + On) + Gate = 0. (14) 
Interchanging 7 and 7 one has 
h h h h h 
Lin Min + Gin) + nil Ang + Gey) F Sudje = 0. 
One-half the sum of these equations is equation (13). One-half their 
difference is 
h h h h h - 
Lny(Age + Api) + SnilAje + jy) + Suai; = 0. (15) 
When to this equation is added the two equations when 7, j, k are permuted 
cyclically twice, the result is 


Bni(2Aj, + Bah.) + gn(2Ati + Bah,) + gn(2at, + 3a%,) = 0. (16) 


This equation is satisfied by equations of the form 


yA h _ 
3 Aj, (17) 
as is also equation (15) in consequence of (6). For these values equation 


(13) is equal to 





VoL. 39, 1958 PHYSICS: L. P. EISENHARI 


(gn) Mie + gnidfe). 


| 
3 


h 
Lrrdiy aid 
From (10) and (17) it follows that 
vh h h 
| it — Ai; — aij 


so that in consequence of (7) 
r=Ti, = 0. (20) 
When equation (11) is multiplied by g‘’g”’ and 7 and j are summed, the 
results after p and qg are replaced by 7 and j respectively is 
B42 et Tie + e°T en = 0. 


When this equation is multiplied by g,, and 7 and 7 are summed, the result 


is 
(log w), = . (21) 
where w is the square root of minus the determinant of g;;.._ If we put 
leah it 
we have from the above equation 
ae + OTe + a" Vin — OTe = 0. 
Denoting by 9% the skew-symmetric part of q’, and by g” the symmetric 


part, not to be confused with q” previously defined, one has 


yh 


’ hit’ A} hiy 
OY et OMT ie — O' Vi, + OT ie — OMT he — OT ie = 0. 
In consequence of (20) it follows that 
ay. as (). (233) 


When the condition of integrability is applied to equation (9) it is 
found’ that the quantities I” .., defined by 


wh wh wh li ph Jp 
Dijk ond ry a Die J 2 WF iv 5 Piel i, 
are components of a tensor. Contracting for 7 and k we have in conse- 
quence of (20) 
, vh uw ph vh 
Ty = Tin — Val; + TisFua — Tin, 5 (24) 


The symmetric and skew-symmetric parts of this tensor are 


: sh e a J ph 1 wh wh 
Py; = Visa ~ Tals + Tals + Tila — 5 Via,5 + Ui, i) 





PHYSICS: L. P. EISENHA RT Proc. N. A. S. 


. : sh yl yh ul ph a yh l sh sh 
r,; 2 + Dial j Pa Pala Clik — 5 (Tis, con I"m, i). 


. 


From (10) and (19) one has 


h r I 


" he h h 
Dy = Ay tr ayy y= 3 Aji, 
J a 


which enables one to express [';, in terms of gi, and gy. 
The Ricci tensor Rj; in terms of g,; is given by* 


a t l a U 
R | ae Ai h - Ain dit = Ai At as (Ain, + jn, i)» 


tj 9) 
since 
} 
Aj, = (log «),; 


where uw is the square-root of minus the determinant of g;). 
By means of (25) and (26) it is found that 


’ h l h a l l h l h h 
I 1) R,; T aij: ' oe A pl; + AQ! 1 9 Aindj oa 5 (A jh. j “|- jh: i)s 


(27) 
where a semi-colon followed by index indicates a covariant derivative 
based upon the g,). 


In like manner 


' l ah hl bt os st lk h 

I ij 3 (Ajj. h + AjQin + Aj iQ jh ” Ap) mn (Cin; a (jh; i}: (28) 
In consequence of (21) the tensor [,,, defined by (24), is the Einstein 
= (), and 


’ 


tensor R;;.5 His results are (22), (23), I, 


Ri = 0, Ry.» 4 Ry. 4 of Ry, j = () (29) 


He interprets (23) as the expression for the vanishing of the magnetic 
current, and from the tensor A;,;, he derives the electric current density. 
From (27) it follows that, if, = 0 the first of equations (29) is 


Ry = Ay 


where R;, is the Ricci tensor, not to be confused with the present Einstein 
tensor R,,, where A, is given by (27). These quantities must be such 


that the identities® 
tk l 
g | Ru. — 5 Ru; s) = 0 


be satisfied. 





VoL. 39, 1953 PHYSICS: L. PAULING 


' These PROCEEDINGS, 37, 311 (1951). 

2. R.G.,p.17. This refers to the author’s Riemannian Geometry, Princeton University 
Press 

§ Kisenhart, L. P., ‘“Non-Riemannian Geometry,’’ Amer. Math. Soc. Colloquium 


Publications, 5, p. 5 
‘RK. 6... ps 2) 
5 Einstein, A., The Meaning of Relativity, Fourth Edition, Princeton University 
Press, pp. 144, 146, 147. 
*R. G., p. 82 


A THEORY OF FERROMAGNETISM 
By Linus PAULING 


GATES AND CRELLIN LABORATORIES OF CHEMISTRY,* CALIFORNIA INSTITUTE OF 
TECHNOLOGY 


Communicated April 1, 1958 


The properties of ferromagnetic substances are in reasonably good 
accord with the theory of Weiss.'. In this theory it is postulated that 
the atomic magnets tend to be brought into parallel orientation not only 
by an applied magnetic field but also by an inner field which is propor- 
tional to the magnetization of the substance. The inner field is not due 
to magnetic interaction of the magnetic moments of the molecules, but to 
electrostatic interactions, which are related to the orientation of the 
magnetic moments of electrons through the Pauli principle. During the 
past twenty-five years many efforts have been made to develop a precise 
theory of the interactions that produce the inner field, and to account in 
this way for the observed magnetic properties of ferromagnetic substances, 
but these attempts have not been successful—no one has published a theory 
of the electronic structure of ferromagnetic substances that permits reason- 
ably good predictions to be made of the values of the saturation magnetic 
moment and the Curie temperature. 


I have now formulated a theory of ferromagnetism that seems to provide 


a simple explanation of the phenomenon. In the following paragraphs 


it is shown that the straightforward application of the theory to iron leads, 
with use only of spectroscopic data for the iron atom, to the predicted values 
2.20 Bohr magnetons for the saturation magnetic moment per iron atom 
and 1350° K for the Curie temperaiure; these values are in reasonable 
agreement with the experimental values, 2.22 magnetons and 1043° K, 
respectively. 

The theory can also be applied to cobalt and nickel, and to alloys. In 
this application, however, a complicating factor must be considered 





PHYSICS: L. PAULING Proc. N. A. S. 


the contribution of two different values for the atomic magnetic moment. 
Satisfactory agreement between theory and experiment is obtained when 
the ratio of contributions of the two atomic moments has the value assigned 
to it in a discussion of the electronic structure of these metals that was 
published fifteen years ago.” 

The Nature of the Theory.—In discussion of the transition metals the 
electrons may be divided into two classes, the conduction electrons and 
the atomic electrons. The conduction electrons are the electrons for 
which the exchange integrals between adjacent atoms are large, and the 
atomic electrons are the electrons for which the exchange integrals are 
small. Some investigators have thought that the electrons occupying 
4s and 4p orbitals comprise the conduction electrons, and those occupying 
the 3d orbitals (plus those in inner shells) comprise the atomic electrons. 
The first quantum mechanical theory of ferromagnetism is that of Heisen- 
berg. It was shown through the work of Hund, Heitler and London, and 
others that the exchange integral between electrons with antiparallel spin 
on the same atom usually has positive sign, whereas that between electrons 
with antiparallel spin on different atoms has negative sign. In conse- 
quence, atomic electrons align their spins parallel, to the extent permitted 
by Pauli’s principle. Moreover, it would be expected that atoms that 
have a magnetic moment, because of the presence of one or more unpaired 
electrons on each atom, would tend to align their moments antiparallel. 
The atomic moments of atoms in a crystal would then tend to divide into 
two classes, such that the moments of the neighbors of each atom are 
oriented antiparallel to the moment of that atom. This would give rise 
to antiferromagnetism (when the moments of the two kinds of atoms are 
equal) or to ferrimagnetism (when the moments are unequal). Heisen- 
berg assumed, in order to account for the phenomenon of ferromagnetism, 
that under certain circumstances the exchange integrals of electrons with 
antiparallel spins on adjacent atoms are positive; the atomic moments 
would then tend to line up parallel to one another. Heisenberg’s theory 
is unsatisfactory in that there is no independent evidence in support of 
the assumption that exchange integrals between atoms are ever positive; 
this remains an ad hoc assumption. Moreover, the theory does not provide 
a reasonable explanation of the observed values of the saturation magnetic 
moment. 

A somewhat different suggestion about the origin of ferromagnetism 
was made by Bloch.‘ The conduction electrons usually occupy the low 
levels of a conduction band of orbitals in pairs, with opposed spins. Pauli® 
pointed out that at temperature equilibrium a few of the pairs, occupying 
the uppermost levels, would be uncoupled by thermal agitation, one of the 
electrons being raised to a slightly higher level. These uncoupled electrons 
could then orient themselves independently in a magnetic field; Pauli in 





Vox. 39, 1958 PHYSICS: L. PAULING 558 


this way explained the temperature-independent paramagnetism observed 
for the alkali metals and some other metals. Bloch observed that under 
some circumstances (proper ratio of exchange integrals to the density of 
states in energy) the exchange energy of the conduction electrons could 
accomplish some uncoupling, producing a set of conduction electrons 
occupying separate orbitals, and with parallel spins. The theory has been 
discussed further by Slater,® who has suggested that a rather weak inter- 
action between 3d orbitals on adjacent atoms would give rise to a narrow 
band of levels, with high density of states in energy, and that the conditions 
for ferromagnetism might be satisfied in this way. In order to account for 
the observed moment 2.22 magnetons per atom for iron, the assumption 
is made that 0.22 electron per atom is transferred from the 3d orbitals to 
the 4s orbital, constituting the main conduction band; the atomic moment 
is then ascribed to the hole of 2.22 unoccupied orbitals of the 3d subshell, 
which contains 5.78 electrons. 

Recently it was pointed cut by Zener’ that the atomic moments, in 
parallel orientation, might react with the electrons in the conduction band 
in such a way as to uncouple some of the pairs, producing a set of conduc- 
tion electrons occupying individual orbitals, and with spins parallel to the 
spins of the atomic electrons. Zener assumed that the conduction band 
for the transition metals is formed by the 4s orbitals of the atoms, and that 
there is somewhat less than one conduction electron per atom in iron, 
cobalt, and nickel. Like Slater, he attributed the atomic magnetic 
moments to the partially filled 3d subshell. 

The new theory is based upon the assumption that there are about six 
conduction electrons per atom in the transition metals. In the first dis- 
cussion of the new metallic valences? the value 5.78 was assigned as the 
normal metallic valence of the atoms chromium, manganese, iron, cobalt, 
and nickel. This value was obtained by subtracting from the total 
number of outer electrons of iron (eight electrons, outside of the argon 
shell) the number 2.22 of unpaired electrons, as indicated by the value 
2.22 for the saturation magnetic moment of iron. It now seems probable 
that the valence is equal exactly or nearly exactly to six. In the resonat- 
ing-valence-bond theory of metals* it is assumed that the 3d, 4s, and 4p 
orbitals of an atom hybridize to form bond orbitals, which are combined 
with bond orbitals of other atoms in the crystal, with use of Bloch fune- 
tions, to produce orbitals that are occupied by electron pairs. These 
orbitals are closely similar to those used for occupancy by single electrons 


or by two electrons with opposed spins in the usual electron theory of 
metals. It might be expected that as many atomic orbitals could be 
combined in this way into orbitals for conduction electrons (bonding 
electrons) as satisfy the criterion of large cverlapping, characteristic of 
good bond orbitals. The best bond orbital that can be formed from 





554 PHYSICS: L. PAULING Proc. N. A. S. 


3d, 4s, and 4p orbitals is a 3d°4s4p* orbital, for which the bond strength 
has value 3.000. Two equivalent orbitals orthogonal to the first one can 
be constructed. It has been found that a fourth, a fifth, and a sixth of 
nearly equal strength can also be made, but that a seventh orbital, orthog- 
onal to the other six, is a much poorer bend orbital. Hence it seems not 
unreasonable to assume that only six bonds will be formed by an atom in 
a metal, and that the remaining orbitals may be occupied by electrons which 
interact only weakly with electrons on other atoms. 

In iron, cobalt, and nickel there are some unpaired atomic electrons, 
giving rise to an atomic magnetic moment. There are also six electrons 
per atom which occupy a conduction band—these are the valence electrons. 
It is assumed that the direct interaction of the atomic electrons on neighbor- 
ing atoms is small, and that the exchange integral is, as is customary, 
negative. If this interaction is larger than the interaction with the 
conduction electrons, which will be discussed immediately, the moments 
on adjacent atoms will tend to align themselves in the antiparallel orienta- 
tion, and the substance will be antiferromagnetic. If the direct inter- 
action energy of the moments is smaller than the possible interaction 


energy with the conduction electrons, however, the substance will be 


ferromagnetic. 

The conduction electrons occupy metallic orbitals which are formed by 
linear combination of atomic bond orbitals, which are dsp hybrids. A 
metallic electron, which may be considered to move from atom to atom 
through the crystal, interacts with the electrons producing the atomic 
moment of each atom to the extent that a bond orbital of the atom con- 
tributes to the metallic orbital. The exchange integral for the interaction 
is positive in sign, according to Hund’s principle, and the interaction 
results in stabilization if the spin of the metallic electron is parallel to the 
atomic moment, and in destabilization if the spin is antiparallel. There 
is, then, the possibility of stabilizaticn of the metal crystal if the atomic 
moments are aligned parallel to one another, and a significant number of 
metallic electrons, occupying separate metallic orbitals, align themselves 
with spins parallel to the atomic moments. If the energy of interaction 
of a metallic electron with an atomic moment is —e for parallel spin and 
+e for antiparallel spin, the width of the conduction band, at the top of 
the doubly occupied conduction levels, that can be occupied by single 
electrons, with parallel spin, will be equal to 2e. The energy of uncoupling 
the first pair of electrons (occupying the topmost level) is zero, so that a 
gain in stabilization through the change in energy —2e will be achieved 
by uncoupling this pair of electrons. The gain in energy for the last pair 
is, however, zero, and the average change in energy is — per electron pair, 
or —e¢/2 per electron. If m conduction electrons are thus uncoupled, the 
ferromagnetic crystal will be stabilized through this interaction by the 





VoL. 39, 1953 PHYSICS: L. PAULING 


energy change —mne 2. The energy required to reverse the direction of 
an atomic moment ts equal to 2me; accordingly the energy of interaction 
of the atomic moments and the uncoupled conduction electrons acts in 
the way corresponding to a Weiss field. 

Quantitative calculations can be made on the basis of the assumption 
that the density of levels in energy for the conduction band is given by 
the simple expression for the free electron in a box, and the interaction 
energy ¢ of a dsp hybrid conduction electron and the atomic moment can 
be calculated from the spectroscopic values of the energy of interaction of 
electrons in the isolated atom. The results of this calculation for iron are 
discussed in the following section. 

A pplication of the Theory to Iron.—-We assume, in essential accordance 
with earlier conclusions,” * that six of the eight outer electrons of the iron 
atom are valence electrons, occupying d*sp® orbitals, and the remaining 
two are atomic electrons. 

The electron configuration of the atom in the metal is accordingly 
assumed to be 3d°4s4p*. This assumption differs considerably from the 
usual assumption that the electron configuration is close to 3d°*—for ex- 
ample, Slater's assumption that the configuration is 3d7-4s°"*. It is 
interesting to note, however, that the normal state! of the isolated iron 
atom is not based on the configuration 3d* but on the configuration 3d*4s*; 
its Russell-Saunders symbol is °*). The first excited state is 3d‘4s °F, 
and the third configuration to appear, with increasing energy, is 3d*4s4p. 
There are, in fact, twenty-eight Russell-Saunders states that appear in 
the Fel spectrum below the first state, 3d**F, that is assigned to the 
configuration 3d°. The spectroscopic data indicate that the 4s atomic 
orbital is more stable than the 3d orbital by about 1.2 ev, and that the 
energy of a 4p electron is only about 1.4 ev greater than that of a 3d 
electron. The configuration 3d°4s4p? accordingly lies about 4 ev above 
the normal configuration, 3d*4s*, and only 1.6 ev above the configuration 
3d’. The bond energy of six valence electrons can, of course, easily effect 
the promotion to the assumed configuration 3d°4s4p’*. 

The exchange energy —e(4s, 3d) of a 4s electron and a 3d electron with 
parallel spin can be calculated from the separation of the states 3d74s °F 
and 3d'4s *F. The separation of these states (averaged over the levels 
with different values of total moment /) is 4947 cm~'. In the °F state 
the 4s electron has spin parallel to the spins of three 3d electrons, and in the 
’F state it has spin antiparallel to the spins of these three 3d electrons. 
The value of the exchange energy per electron is accordingly —4947/6 = 
—825 cm~!. For —e(3d, 3d) one sixth of the difference in energy of 
3d°45? °D and 3d*4s? °D was used; its value is —4826 cem~!. The evalua- 
tion of —e€(4p, 3d) as —470 cm™ is similar, although somewhat less 


straightforward, because it involves averaging over a number of Russell- 





556 PHYSICS: L. PAULING Proc. N. A. S. 


Saunders states. Assuming that the valence electrons at the top of 
the band have the average hybrid character 3d*4s4p?, the interaction 
energy of one of these valence electrons and an atomic electron, assumed 
to be approximately a 3d electron, is found to be —2707 cm™', or —0.334 
ev, with probable error about 10%. 

The interaction energy of the valence electron with the two atomic 3d 
electrons, with parallel spins, is accordingly —0.67 ev, and the width of 
the energy band that would be occupied by uncoupled valence electrons 
is 1.34 ev. The number of orbitals in this band can be calculated from the 
equation for the distribution of energy levels for an electron in a box. The 
number of levels per atom is 


ny = 4nV (2m)? /3h3, 


where V is the atomic volume, m is the mass of the electron, / is the 
kinetic energy of the electron, and / is Planck’s constant. For iron, with 
= 11.72 K 10°** em’, and six electrons per atom, which can occupy 
three orbitals, in pairs, the value of /, the energy of the top level occupied 
by conduction electrons if they are all paired, is calculated to be 23.3 ev. 
Differentiation of this equation leads to the equation 


dn = 29rV (2n)'CECdE/h8 


as the number of levels dn with the width in energy dé. Substitution of 
the value 1.34 ev for dé leads to the value 0.26 for dn. Accordingly we 


predict that 0.26 of the conduction electrons will be uncoupled by inter- 
action with the atomic moments, and will align themselves parallel to 


these moments. 

The experimental values of the gyromagnetic ratio and the observed 
magnetic moments for paramagnetic compounds of transition metals show 
that the orbital moment is nearly completely quenched. On multiplying 
2.26 by the ratio 1.946/2.000 of the gyromagnetic ratio to the spin value, 
the predicted value of the saturation magnetic moment per atom for iron 
is obtained as 2.20 magnetons, in excellent agreement with the experi- 
mental value, 2.22 magnetons. 

The calculated energy of interaction of an atomic moment and the Weiss 
field (0.26 uncoupled conduction electrons per atom) for magnetic satura- 
tion is 0.135 ev, or 3070 cal. mole~'. According to the Weiss theory 
the Curie temperature is equal to this energy of interaction divided by 
3k, where k is Boltzmann’s constant. The effect of spatial quantization of 
the atomic moment, with spin quantum number 5, is to introduce the 
factor (S + 1)/S; that is, the Curie temperature is equal to ne(.S + 1)/3Sk. 
For iron, with S = 1, the predicted value for the Curie constant is 1350°K, 
in rough agreement with the experimental value, 1043°K. 

Cobalt and Nickel.—-The discussion of the magnetic properties of cobalt 





Vor. 39, 19538 PHYSICS: L. PAULING 557 


and nickel is somewhat more complicated than that for iron, because of 
the necessity of giving consideration to the metallic orbital. It has been 
pointed out® that on the basis of the resonating-valence-bond theory of 
inetals a metallic structure, showing large electronic conductivity, can 
occur only if many of the atoms in the metal have available one orbital 
not ordinarily occupied by an electron; this orbital can act as receptor 
for an additional electron, permitting the unsynchronized resonance of the 
shared electron pairs in the metal. When the metallic orbital is occupied 
by an electron the atom has a negative charge, and another atom in the 
metal has a positive charge; it is the motion of these negative charges 
and positive charges through the metal that gives rise to electronic con 
duction. (A discussion of the relation between metallic conductivity and 
negatively charged and positively charged atoms in the metal on the basis 
of the conventional electronic theory of metals has been given by Slater.)!! 
An atom of cobalt, with six valence electrons and three atomic electrons, 
essentially 3d electrons, can have the spims of the atomic electrons parallel 
by utilizing all of its orbitals; a metallic orbital is made available only if 
two of the atomic electrons have opposed spins. The metal can be de 
scribed as having some atoms with resultant spin quantum number 34,2 
and some atoms with spin 1 2. In the same way the atom of nickel can 
be described as having some atoms, without a metallic orbital, with spin 1 
and some atoms, with a metallic orbital, with spin 0. The number of 
atoms without a metallic orbital has been estimated® to be 28%). In the 
following calculations we use the value 28.5°7, which is obtained from the 
value 28.57 for the electron number that marks the upper boundary of the 
ferromagnetic region, as given by the intercept of the curve of values of 
saturation magnetic moment corrected by extrapolation for isoelectronic 
sequences to the value O°K for the Curie temperature (Taglang).'* 

With this assumption and use of the value 1.94 for the g-factor the 
average magnetic moment per atom of cobalt is calculated to be 1.52 
magnetons. The spectroscopic data for cobalt do not provide very 
reliable values for the interaction energies of 3d, 4s, and 4p electrons. The 
data for chromium and manganese, when compared with those for iron, 
indicate that the interaction energies increase by 5°, with unit increase 
in the atomic number. We accordingly use the value 0.70 ev for ¢, which 
leads to 0.20 uncoupled valence electrons per atom. The predicted satura 
tion magnetic moment for cobalt is thus 1.72 magnetons, 1n good agree 
ment with the experimental value 1.71 magnetons. The value of the Curie 
temperature is calculated with use of the average value of the magnetic 


moment per atom and the average value of (J + 1)/J to be 1190°K, 


which is somewhat smaller than the experimental value 1393° Kk. 
In nickel 28.5% of the atoms, without a metallic orbital, have two 
atomic electrons with parallel spins, and magnetic moment 1.94 (corre- 





558 PHYSICS; L. PAULING Proc. N. A. S. 


sponding to J 1, with g = 1.94). The value of the interaction energy € 
is 0.74 ev, and the calculated number of uncoupled valence electrons 
per atom is 0.064. The saturation magnetic moment per atom is thus 
calculated to be 0.617, in good agreement with the observed value 0.606 
magnetons. The calculated value of the Curie temperature, 367° K, 1s 
considerably smaller than the observed value, 631° K. 

For alloys of iron, cobalt, nickel, and copper the calculated values of 
saturation magnetic moments agree closely with the observed values; 
in particular, the maximum value of about 2.48 magnetons at electron 
number about 26.3 is reproduced by the theory. There is, however, only 
rough agreement between the observed and calculated values of the Curie 
temperature. 

An obvious refinement of the simple theory for cobalt and nickel and 
their alloys can be made which leads to a significant increase in the cal 
culated value of the Curie temperature. The foregoing calculation for 
nickel, for example, is based upon the assumption that the uncoupled 
valence electrons spend equal amounts of time on the nickel atoms with 
J = 1 and the nickel atoms with J = 0. However, the stabilizing inter- 
action of the spins of the valence electrons and the parallel atomic moments 
would cause an increase in the wave function for the valence electrons in 
the neighborhood of the atoms with J = | and the parallel orientation. 
This effect also produces a change in the shape of the curve of saturation 
magnetization as a function of temperature. The details of this refined 
theory will be published later. 

The simple theory accounts for the deviation of the slope of the curve 
of saturation magnetic moment against average electron number from the 
values +1 and —1 by about 10° as resulting from an induced moment 
contributed by the conduction electrons. This explanation was suggested 
and given empirical support by Forrer'’ and Taglang.'” 

Gadolinium.-For gadolinium the saturation magnetic moment 1s 7.12 
magnetons and the Curie temperature is 290° K.'! There is little doubt 
that three of the ten outer electrons of gadolinium are valence electrons 
and the other seven are atomic electrons occupying 4f orbitals, with 


spectroscopic state °S. These seven atomic electrons would give rise to 


the magnetic moment 7.00 magnetons, and the small increment may be 
attributed to uncoupled valence electrons. 

Spectroscopic data are not available to permit an independent estimate 
of the magnitude of the interaction energy of a valence electron and the 
seven atomic electrons. If we assume the value 0.31 ev for this quantity, 
e, the calculated value of the Curie temperature, 290° K, is equal to the 
observed value, and the calculated value of the saturation magnetic 
moment, 7.19 magnetons, is slightly greater than the observed value, 7.12. 
The small disagreement might be explained as resulting from a small 





Vor. 39, 195: PHYSICS: L. PAULING 559 


contribution of orbital moment, reducing the g-factor from 2 to 1.98; 
the value 1.95 + 0.03 for the g-factor has been reported from microwave 
resonance absorption measurements of gadolinium metal in the para- 
inagnetic region." 

Other Interactions. -In the foregoing calculations the direct exchange 
interactions of the atomic electrons on neighboring atoms have been 
neglected. These exchange interactions are expected in general to tend 
to cause the atomic moments on adjacent atoms to align themselves in 
the antiparallel orientation. If these interactions and other interactions 
(such as those involving an unsymmetrical distribution of the electrons in 
an electron-pair bond between adiacent atoms) are less important than the 
interactions that stabilize the parallel orientation of the atomic moments 
the substance will be ferromagnetic at low temperatures, and the value of 
the saturation magnetic moment will be that calculated by the above 
methods. However, the neglected interactions, although they are not 
expected to change significantly the saturation magnetic moment, could 
have a large effect on the calculation of the Curie temperature. 

It has been pointed out by Bozorth'® that not all of the alloys of the iron- 


group transition metals have ferromagnetic saturation moments that 


correspond to the simple theory of six valence electrons and 0.715 metallic 
orbital per atom. It is likely that the exceptional alloys show ferri- 
magnetism or antiferromagnetism that in these alloys the interactions 
that stabilize antiparallel orientation of the moments of adjacent atoms 
are stronger than those favoring parallel orientation. Until these inter 
actions have been subjected to quantitative treatment the theory of 
ferromagnetism, ferrimagnetism, and antiferromagnetism will remain 
incomplete. 

* Contribution No. 1791. This investigation was carried out in connection with a 
program of research on metals supported by contract N6onr-24432 between the California 
Institute of Technology and the Office of Naval Research 

' Weiss, P., J. de Physique, 6, 667 (1907); 1, 166 (1930 

2 Pauling, L., Phys. Rev., 54, 899 (1938 

Heisenberg, W., Z. f. Physik, 49, 619 (1928 

‘ Bloch, F., Z. f. Physik, 57, 545 (19290 

> Pauli, W., Z. f. Physik, 41, 81 (1927) 

5 Slater, J. C., Phys. Rev., 36, 57 (19380); 49, 537, 931 (1936); 52, 198 (1937) 

7 Zener, C., Phys. Rev., 81, 440 (1951) 

* Pauling, L., Proc. Roy. Soc. London, A 196, 843 (1949) 

’ Pauling, L., J. Am. Chem. Soc., 53, 1367 (1931 

! Moore, C. E., Atomic Energy Levels, Vol. 1, June 15, 1949, and Vol. II, August 15, 
1952, U. S. Government Printing Office, Washington 25, D. C 

1! Slater, J. C., Phys. Rev., 35, 509 (1930) 

'2 Taglang, P., Compt. rend., 229, 704, 994 (1949 

* Forrer, R., J. de Physique, 10, 181 (1989); 1, 74 (1940 

‘4 Urbain, G., Weiss, P., and Trombe, F., Compt. rend., 200, 2132 (1935); Trombe, F 
Ann. de Physique, 7, 385 (19387 





PHYSICS: M. G. REDLICH Proc. N. A.S 


> Kip, A. F., Kittel, C., Portis, A. M., Barton, R., and Spedding, F. H., Phys. Rev., 
89, 518 (1953) 
'6 Bozorth, R. M., Phys. Rev., 79, 887 (1950 


KQUAL RATIOS OF ENERGIES OF NUCLEAR EXCITED STATES 
OF EVEN-EVEN ISOTOPES 


By MarTIN G. ReEpLIcH* 


PALMER PHysicaL LABORATORY, PRINCETON UNIVERSITY 
Communicated by FE. P. Wigner, April 15, 1953 


The purpose of this note is to point out a regularity in the measured 
energies of some excited states of isotopic nuclei with even numbers both 
of neutrons and protons (V and Z). The experimental data indicate 
that although the addition of two or four neutrons to an even-even nucleus 
changes the energies of its excited states, in most instances it does not alter 
the ratio between the energies of its first and second excited states. 

In order to describe this relation, as well as some similar ones, let /,,(A ) 
and /:,(A’) denote the energies of the mth and nth excited states of two 
even-even isotopes with mass numbers A and A’ = A + 2 or A + 4, and 
further, let 


lim(A ) 
hky(A ) 


rm(A ) 


In this note, if 7,(A") is equal within 3% to r,(A), the nth level of isotope 


A’ will be said to correspond to the mth level of isotope A. This corre 
spondence will be expressed symbolically as nm ~ m. (The correspondence 
is not symmetric.) For six of the nine pairs of isotopes on which data are 
available, 2. ~ 2. The distribution of correspondences among the re 
maining pairs is: 2~ 3and3 ~ 4 for one pair; 3 ~ 2 for another. There 
is only one pair, Pb*’! and Pb*", for which no correspondence is observed. 
It is possible to conclude from experimental evidence which will be dis 
cussed in a subsequent paragraph that the spins of the second excited 
states of these Pb isotopes are not equal. In view of this fact, the absence 
of a correspondence between these states might be expected. On a graph 
of (A) vs. £(A‘) the correspondence 2 ~ 2 would entail that the points 
(19\(A), £\(A")) and (4,(A), £.(A’)) lie on a straight line through the origin. 

The experimental data are summarized in table 1. The energies of at 
least two excited states of many even-even isotopes can be obtained from 
decay schemes. All elements with two or more such isotopes whose levels 
were found in a recent paper! or in the new Table of Isotopes” are listed in 





Vor. 39, 1958 PHYSICS: M. G. REDLICH 


> 


TABLE 1 
ENERGIES OF EXCITED STATES OF EVEN-EVEN ISOTOPES 
The number below each datum is its estimated uncertainty times 10%. It is enclosed 
in parentheses unless based upon estimates made in the original papers 

Et d rs(A) ra(A) 

MENT F(A) 1N ma 42) Ex(A) Es(A) (A) OTHER CORRE 

AND Z d : MEV J ) (A E\(A) 2(Amin SPONDENCES 

Ti ‘ S904 


») . 
990” 2 245 2 33: 0.993 


(2) 25) ) (17) 

874 3.12 é \ 73(96) r(94) 
< = 1.005 
/ (28) 


ro 114)/r3( 110) 
= (). 979 
4 

r( L14)/rg( 110) 
= 0.984 
9g 


3.278 1.009 
(46) (31) 
1.759 
5 
2. 000 0.976 
(42 (19) 
2.479 2.910° 
, . <] 1 
194 32 >. 402 0.974 
68 6 
196 8 358" 922 0.992 
20) 36 71 
204 yy 37 
(2 
208 2.620 
(S) 
“ The energy of this state is either 2, or E, — E;. The nucleus in the second excited 
state decays by emission of two y-rays in cascade; their sequence is not definitely known, 
° The position of this level is not definitely established 
“ There may exist another level between the ones denoted 3rd and 4th levels here. 
“ The position of the higher level is not definitely established. 


* The decay scheme is not certain 





WO2 PHYSICS: M. G. REDLICH Proc. N. A. S. 


column | of table 1. States observed only in heavy particle reactions 
were not considered because in the many different possible reactions various 
sets of states are excited; in addition, the data are generally less accurate. 
The energy /,(A) and the ratios r,(A) for n = 2, 3, and 4, together with 
their estimated uncertainties, are listed in columns 4 to 7. A blank space 
in the columns for 7;(A) and r4(A) or for 74(A) alone indicates that only 
two or three excited states are known from decay schemes. Footnotes 
in the table refer to all those levels about whose position in the decay 
schemes there remains some doubt. It must be emphasized, however, 
that the decay scheme of no nucleus in table | is entirely certain. For 
example, new levels with energies below that of the highest level listed 
may be discovered. This seems especially likely for Pb?”'. 

It is plain from columns 5 to 7 that correspondences exist. (Quotients 
of the ratios for two isotopes, 7,(A)/%m(A min), are listed in columns 8 and 
9 for the four types of correspondence, n ~ m, which appear. Ain. iS 
the mass number of the lightest isotope. 

The data available (reference |) are consistent with an assignment of 
spin and parity 2+ to the first excited states of about half of the nuclei 
in the table; there is no information about the remaining ones. There 
are data bearing on these quantities for higher excited states of both 
isotopes of two of the elements in table 1: 

(1) Ti. y-y angular correlation measurements are consistent with 
an assumption of spins 2 and 4 for the first and second excited states of 
Ti. Similarly, such measurements are consistent with spins 2 and 4 for 
the first and third (until recently believed to be the second) excited states 
of Ti. One might expect, therefore, that 3 ~ 2 instead of 2 ~ 2, but this 
does not prove to be the case. There is no conclusive evidence, however, 
about the spins and parities of any of the states of Ti®. 

(2) Pb. The second excited state of Pb®* has been assigned spin and 
parity 7—. It is an isomeric state with a half-life of about one hour. 
The mass number of this isotope is classified as ‘“‘probable”’ in reference 2. 
The second excited state of Pb*” was assigned spin 4 from y-y angular 
correlation measurements. It is not surprising, therefore, that these 
two states do not correspond. 

It is readily seen that the probability for chance occurrence of the 
2 ~ 2 correspondences is small. Let Q equal the smaller divided by the 
larger of the two numbers ro(A) and r.(B) for two even-even nuclei, not 
necessarily isotopes, with mass numbers A and B. By definition, 0 < 
Q< 1. Q is actually restricted to a smaller interval, however, since the 
spacing of the first two levels is qualitatively similar for different even- 
even nuclei. The ratios r.(A) in column 5 of the table, for example, vary 
only from 1.222 to 3.420, and values of Q for all pairs lie between 0.36 


and 1. In the probability calculation an even smaller interval, 0.5 < 
Q < 1, will be taken as the statistical sample space. 





Vor. 39, 19538 PSYCHOLOGY: BERSH ET AL 563 


Let us select now the quotients Q for the nine pairs of isotopes, with 
mass numbers A and A,in, on which data are available. Six of these 
quotients are in the interval 0.97 < Q < 1. The probability that six or 
more among nine numbers chosen at random from a sample space of length 
0.50 lie in an interval of length 0.03, i.e., in 6% of the total interval, is 


9 »\k 9—k 9). 6 
»} (0.06)°(0.94)° = 30°10" 
6 \R 


k 


Acknowledgment.—It is with gratitude that | acknowledge Professor 
E. P. Wigner’s advice and interest. 


* Hercules Fellow. 
' Scharff-Goldhaber, Gertrude, Phys. Rev., 1958 (in press) 
? Hollander, Perlman, and Seaborg, Revs. Mod. Phys., 1958 (in press) 


THE EFFECT UPON HEART RATE CONDITIONING OF 
RANDOMLY VARYING THE INTERVAL BETWEEN 
CONDITIONED AND UNCONDITIONED STIMULI* 


By P. J. Bersu, W. N. SCHOENFELD, AND J. M. NOTTERMAN 
COLUMBIA UNIVERSITY 
Communicated by C. H. Graham, April 15, 1953 


The experiment reported here continues the investigation of variables 
which affect the conditioning and extinction of a heart rate response in hu- 
man beings.'~* No record has been found in the literature of any attempt 
to carry out Pavlovian conditioning by randomly varying the interval be- 
tween conditioned stimulus (CS) and unconditioned stimulus (US), while 
providing reinforcement on a regular schedule. The present experiment 
compares the response strength generated by such a procedure, with that 
produced by regular and irregular reinforcement schedules involving a 
constant CS-US interval. 

Subject and Apparatus.— Thirty male university students, ranging in age 
from 20 to 35 years, were used as subjects. After screening for cardiac and 
psychiatric history,' they were assigned at random to the following three 
groups consisting of ten subjects each: (a) Group C,: constant CS-US in- 
terval, regular reinforcement; (b) Group C,: constant CS-US interval, 
irregular reinforcement; and (c) Group V,: randomly varying CS-US in- 
tervals, regular reinforcement. 

The apparatus was the same as that employed in the earlier studies and 
is described in detail elsewhere.' The important features were an electro- 





564 PSYCHOLOGY: BERSH ET AL Proc. N. A. S. 


cardiograph; a 750-cps tone, at 20 db, transmitted to the subject through a 
headset; and a 30-v. a. c. shock administered through two electrodes 
strapped to the palmar and dorsal sides of the subject's left hand. 

Procedure.—-Each subject was carried through three experimental phases 
in a single session: basal (preconditioning); conditioning; and extinction. 
Before the start of the basal phase, all subjects were given the following in- 
structions: 

“This is an experiment to determine your heart rate reaction to tone, and 
your heart rate reaction to electric shock. You will hear a series of tones 
through this headset which will soon be placed over your ears. A little 
later in the experiment, you will receive a series of electric shocks adminis- 
tered to your left hand; you will not be shocked through the electrodes I 
am now placing on your chest and left leg. These electrodes enable us to re- 
cord your heart action. The shocks you will receive will be strong enough for 
you to feel them, but will not be strong enough to do you any harm. Re- 
member that they will just pass through your hand, and not your body. 
Please try not to make any unnecessary movements. I will be keeping a 
continuous record of your heart action while you are in here, and since the 
apparatus 1s necessarily very sensitive, your moving around too much 
would interfere with the recording.” 

During the basal and extinction phases, all subjects were exposed to uni- 
form training. The basal phase consisted of 20 tone-alone, and the extine- 
tion phase of 10 tone-alone, trials. On each trial, the subject's heart rate 
was recorded continuously from about 30 seconds before tone presentation 
to about 45 seconds after termination of the tone. On reinforced condi- 
tioning trials, recording lasted from 30 seconds before tone presentation to 
30 seconds after shock termination, The durations of tone and shock were 
| second and 6 seconds, respectively, with the interval between tone and 
shock as indicated below. Throughout all phases of the experiment, spac- 
ing of trials was randomly varied over a range of from 1-2 minutes. 

The procedure during the conditioning phase differed for the three 
groups as follows: 

(a) Group C,. This group received 15 tone-shock pairings. On each 
trial, shock followed termination of the tone by 9 seconds. 

(b) Group C,;. This group also was given 15 reinforced (tone-shock) 
trials, with 10 unreinforced (tone-alone) trials interspersed among them. 
The first five trials were reinforced.!| During the subsequent 20 trials, 
10 unreinforced presentations of the tone were randomly interspersed with 
10 reinforced presentations. On each reinforced trial, the tone-shock inter- 
val was 9 seconds, as in the case of Group C,. 

(c) Group V,. This group received 15 consecutive tone-shock trials, 
with the tone-shock intervals randomized by blocks of five trials each. 
Within each block, five values of tone-shock interval, namely 4, 6, 9, 12 and 





VoL. 39, 1953 PSYCHOLOGY: BERSH ET AL 565 


15 seconds, were presented in random sequence. The mean of the random- 
ized intervals was 9.2 seconds, a fact which led to the use of the 9-sec. inter- 
val, rather than the 6-sec. interval previously reported,'? on all reinforced 
trials for the other two groups. While insertion of a 3-sec. instead of a 4- 
sec. interval would have resulted in a mean interval of exactly 9 seconds for 
Group I’,, 3 seconds are insuflicient to permit at least two heart cycles be- 
tween tone and shock in the case of all subjects, while a single cycle does not 
provide a reliable measure of heart rate. 

Resu/ts.— In the results reported below, the basic data consist of pretone 
and posttone rates in beats per minute (bm). The pretone rate was 1n all 
cases measured from the two beats immediately preceding tone presenta 
tion. For Groups C, and C, on all trials, and for Group I’, during the basal 


rABLE 1 
Group MEAN PRETONE AND Group MEAN PostTONE HEART RATE (B/M) DuRING 
EACH PHASE OF THE EXPERIMENT 
PHASE 
BASAL CONDITIONING EXTINCTION 
TRIALS LAST FIVE ALL TRIALS FIRST FIVE FIVE 
GROUP PRE POST POST PRE POST PRE POST PRE POST POST 
Cr 75.0 75.1 75.5 71.5 76.4 ie 75.4 73.8 75.3 74.2 75.5 73.4 
P>0.05 I 0.001 P<0.01 P>0.05 P>0.05 P>0.05 
79.4 78.5 80.0 73.6 79.3 70 6 79.3 73.4 78.7 72.9 79.8 73.9 
P>0.05 I 0.001 P <0.001 P<0.001 P<0.01 P<0.001 
Vr 6.7 . f.7 80.7 77.8 81.7 77.0 80.7 76.7 82.0 76.3 79.3 76.9 
P>0.05 P<001 P<0.01 P<0.01 P<0.01 003 > P > 002 


TABLE 2 


TESTS OF SIGNIFICANCE BASED UPON PHASE-BY-PHASE INTERGROUP COMPARISONS 


PHAS! 
BASAL CONDITIONING EXTINCTION 


COMPARISON ALL LAST 5 ALI FIRST o Last 5 
Crvs. CG P>0O.05 P<0.05 P<0.05 P<0.06 P<0.08 P<0.8 
Ci vs. V, P>0.05 P<0.01 P<0.05 P>0.05 P>0.05 P<0.02 
V, vs. C, P>0.05 P>0.08 P>0.06 P>0.05 P<0.05 P>0.05 
The underlined values are based upon a one-tail ¢-test applied without the usual 

parametric assumptions 


and extinction phases, the posttone rate was determined from the final two 
beats in the 9-sec. interval following termination of the tone. However, on 
conditioning trials for Group V’,, the posttone rate was based upon the last 
two beats before onset of shock. Since shock occurred at intervals ranging 
from 4 to 15 seconds after termination of the tone, the two beats whose 
measurement was recorded as the posttone value varied in temporal po- 
sition accordingly. The consequences of this variation in the time of meas- 
urement of posttone rate for Group V, are discussed later. 


The phase-by-phase results for the three groups are summarized in table 


1. The two values for each phase are the group mean pretone and the 
group mean posttone heart rates, respectively, in beats per minute. Each 





566 PSYCHOLOGY: BERSH ET AL. Proc. N. A. S. 


value was obtained by averaging the pretone and posttone means for indi- 
vidual subjects in each group on the indicated trials of the given phase; the 
basal values represent trials 11-20 of that phase.' All intragroup compari- 
sons reported in table 1, or elsewhere in the text, are based upon Pitman’s 
variation of a non-parametric test for paired items developed by Fisher.’ 
The P values 1n table | are for the obtained differences between mean pre- 
tone and mean posttone rates during each of the phases reported. 

For intergroup comparisons, a summary of P values based largely upon 
Wilcoxon’s® non-parametric test for unpaired items, is given in table 2. 





10 


MEAN HEART RATE DROP 
BEATS/MINUTE 











CS-US INTERVAL - SECONDS 
FIGURE 1 
Acquisition. Mean heart rate drop in Group V; as a function of CS-US interval. 
Means are based on all trials of a given interval occurring during conditioning. The 
short horizontal lines are the mean heart rate drops (CS-US interval of nine seconds) 
for Groups C; and C;, respectively. 


In each case where a significant result is reported, the group mentioned 
first in the comparison showed the larger heart rate change. Where Wil- 
coxon’s test did not yield significance, the more powerful but less conven- 
ient f-test was resorted to without the usual parametric assumptions,’ the 
underlined figures being one-tail values; where the value reads simply 
“P< 0.05", the given comparison would not be significant on the basis of a 
two-tail test. A table entry of ‘‘P > 0.05” indicates that neither test led to 
a significant result. 

Since the members of Group V, were exposed to five different CS-US in- 
tervals during conditioning, an analysis of heart rate changes was made 





VoL. 39, 1953 PSYCHOLOGY: BERSH ET AL 567 


separately for each of the intervals (the posttone measure in each case rep- 
resenting the last two beats before shock), and the mean results by intervals 
for the conditioning phase taken as a whole are graphed in figure 1. Each 
point is the mean of all determinations for the given interval. In view of 
the conditioning history of the members of Group V,, an analysis by inter- 


vals also seemed desirable for extinction, and was actually feasible, since 
the absence of shock during extinction made it possible to obtain a meas- 
urement, on every trial, for each of the five intervals previously used dur- 





10 








MEAN HEART RATE DROP 
BEATS/MINUTE 








l | 
6 9 





CS-US INTERVAL - SECONDS 
FIGURE 2 

Extinction. Mean heart rate drop for Group V, as a function of CS-US interval 
Means are based on all trials of a given interval occurring during extinction. The 
short horizontal lines are the mean heart rate drops (CS-US interval of nine seconds) 
for Groups C; and C), respectively. The inset provides analogous data for Groups V, 
and C, for the two halves of extinction. The solid curves are data for Group V;; the 
broken curves are data for Group C,;. The open circles represent means based on trials 
2-6; the solid circles are means based on trials 7-11. 


ing conditioning. In figure 2 the results for this analysis are plotted for the 
entire extinction phase. The inset represents a similar analysis for the 
extinction phase divided into halves te correspond with the breakdown for 
this phase in table 1. Also included in the figure 2 inset are plots of analo- 
gous data for Group C,. These provide an interval-by-interval comparison 
of the amplitude of the acquired heart rate response during each half of 
extinction for the two groups previously given shock on every conditioning 
trial. 





568 PSYCHOLOGY: BERSH ET AL Proc. N. A. S. 


Discussion. (1) Acquisition.—The data in table | show that conditioning 
was successful for Group V,, as well as for the other two groups. The over- 
all mean drops were 6.4, 4.0 and 2.9 b/m for Groups C,, C, and V,, respec- 
tively. While these values point to some superiority of a regular reinforce- 
ment schedule for a constant interval (Group C,) over a varying interval 
(Group V,), this is largely due to the precedure for computing the posttone 
rate for Group V,. As noted earlier, this rate was measured from the two 
beats before shock onset, which, from trial to trial, varied from 4 to 15 sec- 
onds following the end of the tone. For conditioning taken as a whole, the 
maximum amplitude of the conditioned response (CR) occurs at the 9-sec. 
posttone interval (Fig. 1). When the mean change in heart rate is com- 
puted for Group V’, solely on the basis of 9-sec. trials, the value becomes 5.4 
b/m, indicating a conditioned response at least equal to that for Group C,. 

The strongest conditioning was manifested by Group C,, as evidenced by 
the significance tests reported in table 2. The table 2 comparisons be- 
tween Groups C, and V, are biased toward significance by the previously 
noted method of measuring the Group I’, posttone rate; but even when the 
mean values for Group V, are obtained for 9-sec. trials alone, they remain 
smaller than those for Group C, (P = 0.05). 

Of some interest is the influence of the varying interval schedule upon the 
way in which the heart rate varies within each trial as a function of time 
since the end of the tone. As mentioned above, for conditioning as a whole 
the CR tends to reach its maximum amplitude at about 9 seconds, the ap- 
proximate mean of the intervals used. The data also contain some sta- 
tistically non-significant indications that, as conditioning progresses, there 
is a temporal shift in the maximum amplitude of the CR so that it tends to 
occur later and later in the posttone period. 

(2) Extinction.— Comparisons of the relative response strength generated 
by the three conditioning procedures are also afforded by the extinction 
data. Table | shows that the varying interval schedule generates a signifi- 
cant resistance to extinction. Over the first half of extinction, the ac- 
quired response holds up about as well for Group V, as for Group C,, and 
significantly better (vide Table 2) for Group V, than for Group C,. Fur- 
thermore, the mean conditioned response for Group V, remains significant 
for the last half of extinction, though, as seen in table 2, there is a signifi- 
cant difference in favor of Group C, for this part of the extinction phase. 
On the other hand, none of the heart rate changes manifested by Group C, 
during extinction are significant. Presumably extinction occurs very rap- 
idly for Group C,, since at the end of conditioning (extinction trial 1) the 
mean heart rate change (a drop of 7.5 b/m) is significant (P < 0.01), while 
on the subsequent extinction trial, the mean drop in rate (2.9 b/ m) already 
is not significant. It should be remembered that the heart rate change on 
trial | of extinction is recorded before the first omission of shock, and that 





VoL. 39, 19538 PSYCHOLOGY: BERSH ET AL 569 


the effect of such omission is felt only during the second trial. By compari- 
son with a previous study,' it appears that a constant 9-sec. interval is less 
effective than a constant 6-sec. interval where the schedule of reinforcement 
is regular. Apparently no analogous decrease in effectiveness accompanies 
the use of a constant 9-sec. instead of a constant 6-sec. interval where the 
schedule of reinforcement is irregular. This suggests the possibility of a 
relationship between schedule of reinforcement and the duration of a fixed 
CS-US interval. 

For all groups, the extinction values presented in table 2 are based upon 
posttone rates taken from the last two beats in the 9-second posttone inter- 
val. When heart rate changes during extinction are plotted as a function 
of the intervals employed in the conditioning of Group V,,, it becomes clear 
(vide Fig. 2 and inset) that the CR is greater for Group V, than for Group C, 
through mos. of the range of intervals represented. This is especially true 
for the first half of extinction. During the last half, differences in ampli- 
tude still exist at 4 and 6 seconds, though the progress of extinction has 
about equalized the CR for the 9-15-sec. intervals. 

Cross-comparisons between Groups C, and V, for intervals other than 
the 9-sec. interval already reported in table 2, reveal (Wilcoxon test) that 
for the first half of extinction, the 4-sec. values are also significantly differ- 
ent (0.03 > P > 0.02). None of the cross-comparisons for the last half of 
extinction is significant. 

Further support for the finding that the conditioned response shows more 
resistance to extinction for Group I’, than for Group C, is afforded by intra- 
group comparisons involving the posttone heart rates measured at the 4-, 
6-, 12- and 15-sec. intervals. (Similar comparisons for the 9-sec. interval 
are found in table |.) Thus, for the first half of extinction, Group |’, shows 
a significant heart rate depression at the 4- (P < 0.001), 6- (P < 0.001) and 
15- (0.03 > P > 0.02) sec. intervals on the basis of comparisons of subject 


mean pretone rates with subject mean posttone rates; for the second half of 


extinction, the depression is significant at the 6-sec. interval (0.02 > P > 
0.01). If the heart rate changes manifested by Group V, during the first 
half of the extinction are compared with the changes during trials 11-20 of 
the basal phase, the depression becomes significant for the 12-sec. interval 
also (0.02 > P > 0.01). None of these comparisons reveals any significant 
heart rate depressions for Group C,. 

Summary.—A procedure of randomly varying a tone-shock interval was 
successful in conditioning a depression in heart rate in human subjects. 
The magnitude of the CR obtained with the varying interval group was at 
least equal to that for a constant interval group, where both groups were 
regularly reinforced. A constant interval-irregularly reinforced group 
reached a stronger CR, however, than did the former two groups. The re- 
sistance to extinction of the CR for the varying interval group was greater 





570 PSYCHOLOGY: BERSH ET AL. Proc, N. A. S. 


than that for the constant interval-regular reinforcement group, but less 
than that for the constant interval-irregular reinforcement group. During 
the first half of extinction, the CR held up about equally well for the vary- 
ing interval and the constant interval-irregular reinforcement groups. 


* This study was supported by funds provided under contract AF 18(600)-69 with the 
USAF School of Aviation Medicine, Randolph Field, Texas. 

! Notterman, J. M., Schoenfeld, W. N., and Bersh, P. J., “‘Conditioned Heart Rate 
Response in Human Beings During Experimental Anxiety,’’ J. Comp. Physiol. Psychol., 
45, 1-8 (1952). 

2 Notterman, J. M., Schoenfeld, W. N., and Bersh, P. J., ‘‘Partial Reinforcement and 
Conditioned Heart Rate Response in Human Subjects,’’ Science, 115, 77-79 (1952). 

§ Notterman, J. M., Schoenfeld, W. N., and Bersh, P. J., ‘‘A Comparison of Three 
Extinction Procedures Following Heart Rate Conditioning,” J. Abnorm. Soc. Psychol., 
47, 674-677 (1952). 

* Jenkins, W. O., and Stanley, J. C., Jr., ‘Partial Reinforcement: A Review and 
Critique,”’ Psychol. Bull., 47, 193-234 (1950). 

5 Pitman, E. J. G., Notes on Non-Parametric Statistical Inference, Dept. Mathematical 
Statistics, Columbia Univ., New York, 1948, 

6 Wilcoxon, F., ‘Individual Comparisons by Ranking Methods,” Biometric Bull., 1, 
80-83 (1945). 

7 Moses, Lincoln E., ‘‘ Non-Parametric Statistics for Psychological Research,” Psychol. 
Bull., 49, 122-148 (1952). 











INFORMATION TO CONTRIBUTORS 


Tae Proceepines is the official organ of the NaTionaL ACADEMY OF 
Sciences and of the NaTionaL RESEARCH CouNcIL for the publication of 


and of the Councit and of other American investigators, The PROCEED- 
INGS will aim especially te secure prompt publication of announce- 
ments of discoveries and wide circulation of the results of American re- 
search among investigators in other countries and in all branches of science. 


Articies should be brief. The viewpoint should be comprehensive in 
giving the relation of the paper to previous publications of the author 
of others and in exhibiting, where —- the 
for other branches of science. 
long tables of data should be avoided, tsa Goaten Gv te 

clear the new results and should give some record of 
and data upon which they are based. 


Manuscrirts should be prepared with a current number 

CEEDINGS as a model in matters of form, and should be typewritten i 
duplicate with double spacing, the author retaining one copy. Illustra 
should be confined to text figures of simple character, though more elabo- 
pecs reugrer, aeons die seri: y tie Bmggyt marsala oo needs fo 
pay for their preparation and insertion. Particular attention be 
given to arranging tabular matter in a simple and concise manner. 


Rererences to literature, numbered consecutively, will be placed at the 
end of the article and short footnotes should be avoided. It is suggested 
that references to icals be furnished in some detail and in general in 
accordance with standard adopted for the Subject Catalogue of the 
International Catalog of Scientific Literature, viz., name of author, with 
initials following (ordinarily omitting title of paper), abbreviated name 
of Journal, with place of P Montgomery, ones) (if any), volume, inclusive 
‘pages, year. For — T. H., J. Morph., re 22, 
P31-81b (1911); or, E Raced Schr. physik. Ges -» 25, 
1-142 (1914). 


Manuscripts will be accepted only from members of the Academy or 
from chairmen of the divisions of the National Research Council who will 
assume responsibility for the propriety and scientific standards of the paper 
and should be addressed to Proceedings of the NaTIONAL ACADEMY OF 
Scrences, 2101 Constitution Ave., Washington, 25,D.C. Articles of whicha 
member of the Academy ts author or a co-author may extend to twelve printed 

ges, those by non-members only to six printed pages; space above these 

jnits will be billed to the author at $17.50 per page. 


Proor will ordinarily be sent,—with the understanding that charges for 
an author’s correction shall be billed to him. Authors are therefore re- 
quested to make final revisions on the typewritten manuscripts. 


Reprints should be ordered at the time of submission of manuscript 
They will be furnished to authors at cost, approximately as follows: 
SCHEDULE OF RATES FOR REPRINTS, CARRIAGE CHARGES EXTRA 
No. Copies 50 200 250 
4 pp. .90 6. 6. ? z; 

8 pp. 80 : M4. 5. 
Covers 76 4.80 5 6. 7. 

















CONTENTS 


BIocHEMISTRY.—SEROLOGICAL Reactivity oF SYNTHBTIC POLYGLUCOSES. . . 
By Michael Heidelberger and Alan C. Aisenderg 
Biocamastay.—A New Particis Type iv Cerrain Connective Tissur Ex- 
TRACTS. .By Francis O. Schmitt, Jerome Gross, and John H. Highberger 
Cumastay.—Hyprocey Bonvep HEticat CONFIGURATIONS OF THE POLYPEPTIDE 
Chain 
Gesnetics.—Tue Erpect or a Suppressor ON ALLELIC INOSITOLLESS MUTANTS IN 
By Norman H. Giles and C. W. H. Partridge 
Genstics.—Tue Eyrect of PRETREATMENT WITH INFRA-RED RADIATION ON THE 
X-ray Inpucep Sex-Lomcep Recessive Letra, aNp VISIBLE MUTATION 
Rare 1x Drosophila Melanogaster 
By William J. Young, Henry T. Yost, Jr:; P. T. Ives, and R. Paul Levine 
Matuematics,—A Turory or N-Person GAMES WITH PerFEect INFORMATION . 


Maraesarscs.—Tum Semon Comrmcrion x Tae Untemep Euverant THRory op 
RSLATIVITY 
Martuematics.—ConnecTions BETWEEN "Enrarain’s Two Uniriep THEORIES OF 
RELATIVITY... . . By Vdclav Hlavaty 
SMarmmncatics.—Ow rus Cowrmaction ov Gaoure ann Turin REPRESENTATIONS 
Z i. By E. Inonu and E. P. Wigner 
Marinstarsca.—Enooose 'PRoemary ov THs BROWNTIAN Morton Process. 
y ete a a . By G. Kallianpur and H. Robbins 
Marumearics: —Omwrnus Cuscanro oF re 61-PLace TasBie or Denary Loc- 
anirHms CoMPILED BY PsTERS AND Sretn, BY CALLET, AND BY PARKHURST. 
By Horace S. Uhler 
MaTHEmatics.—SioNIFicaNce Levevs ror A Skew DistrisuTion 
; . By Edwin B. Wilson 
Puvhics:~Cowmeaitenn Rimcain Graces ann Guwana, Revativiry 


Puyace.—A Tazory Or FERROMAGNETISM 
Puysics.—Equat Ratios or ENercres op Nuctear Excirep Sratres or Even- 
' By Martin G. Redlich 
Psycuo.ocy.—Tue Errecr Uron Heart Rate ConpdiIrioniInc or RANDOMLY 
VaRYING THE INTERVAL BeTween ConDITIONED AND UNCONDITIONED Srim- 
By P. J. Bersh, W. N. Schoenfeld, and J, M. Notterman 





